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Abstract 

' Given independent random points X\ , . . . , X n E M. d with common probability 

distribution v, and a positive distance r = r(n) > 0, we construct a random geometric 
■ graph G n with vertex set {1, . . . ,n} where distinct i and j are adjacent when — 

ill ri 

Xj\\ < r. Here ||.|| may be any norm on K , and f may be any probability distribution 
on M. d with a bounded density function. We consider the chromatic number x(G n ) 
of G n and its relation to the clique number u[G n ) as n — > oo. Both McDiarmid [12j 
and Penrose |16| considered the range of r when r <C and the range when 

r 3> and their results showed a dramatic difference between these two 

^ ■ cases. Here we sharpen and extend the earlier results, and in particular we consider 

. the 'phase change' range when r ~ (iiaz^i/cZ with t > a fixed constant. Both [12] 

^>>0 | and [16] asked for the behaviour of the chromatic number in this range. We determine 

constants c(t) such that X ^ G ^ — > c(t) almost surely. Further, we find a "sharp 
threshold" (except for less interesting choices of the norm when the unit ball tiles 
d-space): there is a constant to > such that if t < to then x \% n \ tends to 1 almost 



surely, but if t > to then j^rj tends to a limit > 1 almost surely 



1 Introduction and statement of results 



In this section, after giving some initial definitions including that of the random geometric 
graphs G n , we present our main results on colouring these graphs. We then introduce 
more notation and definitions so that we can specify explicit limits; we discuss fractional 
chromatic numbers and 'generalised scan statistics', which will be key tools in our proofs; 
and we sketch the overall plan of the proofs. 
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1.1 Some definitions and notation: G n , 5, a, x an d u 

To set the stage, we fix a positive integer d and a norm |.| on M. d . Given points x%, . . . , x n 
in R d and a 'threshold distance' r > 0, the corresponding geometric graph G(x\, . . . , x n ; r) 
has vertices 1, . . . , n and distinct vertices i and j are adjacent when \\xi — Xj\\ < r. It will 
be convenient sometimes (when the Xi are distinct) to consider G(x\, . . . having 
as vertices the points x^ rather than their indices i. 

Now introduce a probability distribution v with bounded density function, and consider 
a sequence X\,X% ... of independent random variables each with this distribution. Also 
we need a sequence r = (r(l), r(2), ..) of positive real numbers such that r(n) — > as 
n — > oo. The random geometric graph G n is the geometric graph G(Xi, . . . , X n ; r(n)) 
corresponding to X\, . . . ,X n and r(n). Observe that almost surely the Xi are all distinct 
and we never have \\Xi — Xj\\ = r(n); thus it would not really matter if we took the vertex 
set of G n as {X%, . . . , X n } and said that vertices Xi and Xj (for % ^ j) are adjacent when 
\\Xi-XjW <r(n). 

The distance r = r(n) plays a role similar to that of the edge-probability p(n) for Erdos- 
Renyi random graphs G(n,p). Depending on the choice of r(n), qualitatively different types 
of behaviour can be observed. The various cases are best described in terms of the quantity 
nr d , which scales with the average degree of the graph (for a precise result see appendix A 
of the paper [13] by the second author). We will often refer to the case when nr d / Inn — )■ 
as the sparse case, and the case when nr d / Inn — > oo as the dense case. 

Throughout this paper, we shall use the terminology almost surely (or a.s.) in the 
standard sense from probability theory. That is, a.s. means "with probability one", and 
if Zi, Z2, . . . are random variables and c G R a constant then Z n — » c a.s. means that 
P(Z n — y c) = 1. The notation f(n) g(n) means the same as f(n) = o(g(n)), and the 
notation f(n) ~ g(n) means that f(n)/g(n) — > 1 as n — > 00. So in particular, if Zi, Z2, . . . 
are random variables and (k n ) n is a sequence of numbers then Z n ~ k n a.s. means that 
¥(Z n /k n 1) = 1. 

Before we can state our first result we will need some further notation and definitions. 
We use a to denote the essential supremum of the probability density function / of that 
is 

a := sup{t : vol({x : f(x) > t}) > 0}. 

Here and in the rest of the paper vol(.) denotes the d- dimensional volume (Lebesgue mea- 
sure). We call a the maximum density of v. 

We also need to define the 'packing density' for the given norm ||.||. Informally this 
is the greatest proportion of ~R d that can be filled with disjoint translates of the unit ball 
B := {x G R d : ||x|| < 1}. For K > let N(K) be the maximum cardinality of a collection 
of pairwise disjoint translates of B with centers in (0, K) d . The (translational) packing 
density 8 (of the unit ball B with respect to ||.||) may be defined as 

S:= Um N ( K)jo l( B) 

K^oo K d 

This limit always exists, and < 8 < 1. In the case of the Euclidean norm in M 2 we have 
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8 = ~ 0.907. For an overview of results on packing see for example the books [H] of 
Pach and Agarwal or [TB] of Rogers. 

Recall that a k-colouring of a graph G is a map / : V(G) —> {1, . . . , k} such that 
f(v) ^ f{w) whenever vw G E(G), and that the chromatic number x{G) is the least k for 
which G admits a fc-colouring. Also a clique in G is a set of vertices which are pairwise 
adjacent, and the clique number u(G) is the largest cardinality of a clique (note that uj(G) 
is a trivial lower bound for x{G)). In this paper we are interested mainly in the behaviour 
of the chromatic number x{G n ), and its relation to the clique number u(G n ), of the random 
geometric graph G n as n grows large. 

Assumptions and notation for the random geometric graph G n . For convenience 
of reference, we collect our standard assumptions. We assume throughout that we are given 
a fixed positive integer d and a fixed norm |.| on M. d , with packing density 5. Also v is 
a probability distribution with finite maximum density o\ X 1 ,X 2 ,... are independent 
random variables each with this distribution; r = (r(l),r(2), . . .) is a sequence of positive 
reals such that r(n) — > as n — > oo; and for n — 1,2, ... , the random geometric graph G n 
is the geometric graph G(X±, . . . , X n ; r(n)). 

1.2 Main results 

Our first theorem gives quite a full picture of the different behaviours of the chromatic 
number of the random geometric graph depending on the choice of the sequence r. 

Theorem 1.1 For the random geometric graph G n as in Section fiTil the following hold. 
(i) Suppose that nr d < n~ a for some fixed a > 0. Then 



P (x(Gn) G { 



In n I _|_ 1 

„^ I T 



\n(nr d ) 



In n 



\n{nr d ) 



-, -\- - 



+ 1 j for all but finitely many rij — 1. 



(ii) Suppose that n 6 nr d <C Inn /or all e > 0. Then 

x(G n )~]nn/ln(— a.s. 
(iii) Suppose that — > t G (0, oo). T/ien 

X(C„) ~ / x (t) ■ anr d a.s., 

where f x is given by (11 01) below. It depends only on d and ||.|| ; is continuous and 
non-increasing; and satisfies f x (t) — > v ° 2 j? as t — >■ oo ; and f x {t) — > oo as t 4 0. 
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(iv) Suppose that nr d ^ Inn (but still r — » 0). Then 



x{G n ) ~ ■ o-nr d a.s. 



Part (ii) can basically already be found in Penrose's book [16], and also in [12] for 



the case when al = 2 and II. II is the euclidean norm. We have however settled the minor 



technical issue of improving the type of convergence in (ii) from convergence in probability 



to almost sure convergence, which was mentioned as an open problem in [16] and [12j . 



In part (iv) we obtain an improvement over a result in Penrose [16], where an almost 



sure upper bound for lim sup x ^") of v °^/ and an almost sure lower bound for lim inf x ^"} 

of v °2dfi are given. Here 5l is the lattice packing density of B (that is, the proportion of 
M d that can be filled with disjoint translates of B whose centres are the integer linear 
combinations of some basis for R d ). The paper [12] by the first author considers only the 
Euclidean norm in the plane, where S and 8l coincide. However, let us note that in general 
dimension the question of whether 5 = 5l is open, even for the Euclidean norm, and it 
may well be that 5 > 5l for some dimensions d. 



Part (iii) settles the open problem of a "law of large numbers for x" in this regime posed 
in [16]. Both Penrose [16] and the first author [T2] have studied the chromatic number in 
the cases when nr d <C Inn and nr d 3> Inn, but not much was known previously about 
the behaviour of the chromatic number in the "intermediate" regime when nr d = 0(lnn). 
The limiting constant f x (t) is given explicitly by ([TO]) below: since it requires an involved 
sequence of definitions we defer the precise definition until then. 

For comparison to Theorem 11.11 above, we shall now give a result on the clique number 
in the same flavour as Theorem 11.11 The results listed in Theorem 11.21 below were already 
shown by Penrose [IB] : except that there was an extra assumption that the probability 



density function of v has compact support, and in the regime considered in part (ii) 



only convergence in probability was shown and we have added some detail in the regime 



considered in part (i) 



Theorem 1.2 For the random geometric graph G n as in Section [777], the following hold. 
(i) Suppose that nr d < n~ a for some fixed a > 0. Then 



In n I I 1 

\n{nr d ) I 2 



Inn lil 



\n(nr 



n + 2 



1 \ for all but finitely many n ) = 1 



(ii) Suppose that n £ nr d <C Inn for all e > 0. Then 



u(G n ) ~ Inn/ In 



Inn 



nr u 



a.s. 
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(iii) Suppose that — > t E (0, oo). Then 



u(G n ) ~ f u (t) ■ anr d a.s. 

Here f u (t) is the unique f > vo ^ d B •* that solves H(f2 d / vol(B)) = tv ^ B ^ , where 
H(x) := xlnx— x+1 forx > 0. The function f w is continuous and strictly decreasing; 
and satisfies f u (t) v ° 2 d as t — >■ oo, and fu(i) — > oo as 1 1 0. 

(iv) Suppose that nr d ^ Inn (but still r — > 0). Then 

u{G n ) ^•(rnr" a.s. 

Comparing Theorem 1 1.1 1 to Theorem II .21 it is rather striking that when nr d <C Inn the 
chromatic and clique number have the same behaviour, while when nr d 3> In n they differ 
by a multiplicative factor (provided 5 < 1). Clearly there is some switch in behaviour 
when nr d = O(lnn). The following result shows a threshold phenomenon occurs (provided 
5 < 1). 

Theorem 1.3 The following hold for f x , f u . 
(i) If 5 = 1 then f x (t) = f u (t) for all t E (0, oo). 

(ii) If 5 < 1 then there exists a constant < to < oo such that f x (t) = fu){t) for all t <to 
and the ratio f x (t)/fu(t) is continuous and strictly increasing for t > t . 

For t E (0, oo) let us write 

Observe that, by the properties of f x and f^ listed in Theorem 11.11 and Theorem 11.21 the 
function f x / w depends only on the choice of d and ||.||, it is continuous on (0, oo) and 

lim f x/u} (t) = i. (2) 

t— >oo 

By adding a small amount of work to the proofs of Theorem 11.11 and Theorem 11.21 we will 
also show that 

Theorem 1.4 For the random geometric graph G n as in Section with r = r(n) an 
arbitrary sequence that tends to 0, the following holds. Set t(n) := anr d /\nn then 

' fx/M n )) a - s - 



' tp Inn ' 



So in particular, when S < 1, we see a sharp threshold at ro := ( /"^'" I 
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Corollary 1.5 Consider the random geometric graph G n as in Section XTTh Suppose 5 < 1 
and let to be the constant in part (ii) of Theorem \1.3l If we set r := ( f ln " ) s then the 
following hold 



(i) //limsup^oo £ < 1 toen 



X(gn) 



— > 1 a.s. 



(ii) 7/liminf^oo ^ > 1 i/ien 



hmmr > 1 a.s. 

In the course of proving Theorem II .41 we shall prove the following result, which may be 
of independent interest. It shows that for very small r the clique number and chromatic 
number are not only concentrated on the same two consecutive integers (as shown by 
parts (i) of Theorems 11.11 and 11.21) . but in fact the chromatic number and clique number 
are equal. 

Proposition 1.6 For the random geometric graph G n as in Section [777} if nr d < n~ a for 

some fixed a > then 



F(x{G n ) = u)(G n ) for all but finitely many n) 



1. 



1.3 The weighted integral £ and explicit limits 

For x > let H(x) = J^lny dy = xlnx — x + 1 as in Theorem 11.21 Observe that H(l) = 
and that the function H{x) is strictly increasing for x > 1. 

Now let ip be a fixed non- negative, bounded, measurable function with < f Rd (p(x)dx < 
oo. For s > let 

f(s) := [ H(e s ^ x) )dx. 

It is routine to check that /(0) = 0, that f(s) is continuous and strictly increasing in s, 
that f(s) < oo for all s > and that f(s) — > oo as s — > oo. For < t < oo the weighting 
value s(ip,t) is defined to be the unique nonnegative solution s to f(s) = 1/t. Observe 
that the function s(cp,t) is strictly decreasing in t, s(<p,t) — > oo as t — > and s(<p,t) 
as t — > oo. 

We define £(<p, t) for < t < oo by 

f(y>,t) := / v{x)e s ^ x) dx (3) 
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where s is the weighting value s(<p,t). So far, £,((p,t) is strictly decreasing in t. It is 
convenient also to set 



£(</?, oo) : = / p. (4) 



Further, if J p = (in which case p = almost everywhere) then set £,((p,t) = for each 
t G [0, oo], and if J p> = oo then set s(<p,t) = and t) = oo for each t G [0, oo]. We 
call £,((f,t) the weighted integral : note that the function £ depends only on the dimension 
d. 

We may identify t) when = 1^ for a measurable set C M. d with < vol(W) < 
oo. For each w > define c(w,t) for £ G (0, oo] as follows: set c(w, oo) = w, and for 
< t < oo let c(w, i) be the unique solution x > w to ) = —i. Observe that c(w,t) is 
continuous and strictly decreasing in t for t G (0, oo); and that 



c(w,t) — > oo as t — > 0, (5) 



and 



c(w, t) —)■ w as t — > oo. (6) 

For < t < oo we have £(liy,t) = e s vo\(W) where s is such that H(e s ) vo\(W) = |; and 
so 

£( W) = c(vol(WV) (7) 
and this holds also for t = oo since then both sides equal vol(W). 

Let _B(x; p) denote the ball {y : — y\\ < p}, so that B = £>(0; 1). Let us set 

<A) := 1 B(o;ly (8) 
Observe that the function f u in Theorem 11.21 satisfies 

f u (t) = c(vol(5(0; 1/2)), t) = f (^ , *). (9) 
So in particular, by the properties of c(w,t) listed above, f u satisfies the properties 



listed in part (iii) of Theorem 11.21 

Call a set S C well-spread if — w\\ > 1 for all v ^ w G S; and let S denote 
the collection of all such sets. Finally here we call a nonnegative, measurable function 
tp : M. d — y M. (dual) feasible if it satisfies the condition that ^2 ve s¥{ v ) — 1 f° r each set 
S G S. For example, tfo is feasible. Denote the set of all feasible functions by 3 r . We may 
now define the real-valued function f x on (0, oo) in Theorem ll.il by setting 

f x (t) := sup £(tp, t) for0<t<oo. (10) 
Finally note that the real-valued function f x / w on (0, oo) defined by (DQ) satisfies 

/xM*) = q for < t < oo. (11) 
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1.4 Fractional chromatic number 



We shall see that, in Theorem 11.11 the same conclusion holds if we replace x{G) by the 
fractional chromatic number Xf{G) an d indeed this is the key to the proofs. 

Recall that a stable or independent set in a graph G is a set of vertices which are pairwise 
non-adjacent, and the chromatic number x{G) °f G corresponds to a natural integer linear 
program (ILP), expressing the fact that the chromatic number is the least number of stable 
sets needed to cover the vertices, as follows. Let A be the vertex-stable set incidence matrix 
of G, that is, the rows of A are indexed by the vertices v, the columns are indexed by the 
stable sets S, and (A) V) s = 1 if v G S and (A) V) s = otherwise. Then x(G) equals 

min l T x 

subject to Ax > 1, (12) 
x > 0, x integral. 

The fractional chromatic number Xf(G) °f a graph G is the objective value of the LP- 
relaxation of ( fl2l) (that is, we drop the constraint that x be integral). 

It is easy to see that always oj(G) < Xf(G) < x(G). In general both the ratios 
Xf{G) /uj{G) and x(G)/Xf(G) can De arbitrarily large (see for example chapter 3 of 
Scheinerman and Ullman [19]), but that is not the case for geometric graphs (for a given 
norm on M. d ). For x{G) A(G) + 1 for any graph G, as shown by a natural greedy 
colouring algorithm; and, since we may cover the unit ball B with a finite number k 
of sets of diameter < 1, for any geometric graph G we have A(G) + 1 < ku(G), and so 
< Xf(G) < x(G) < ku(G). (The diameter of a set A C R d is sup{||x-y|| : x,y G A}.) 

For the random geometric graph G n the two quantities x(Gn) an d Xf{G n ) are even 
closer. Our approach to proving Theorem 11.11 and the other results will naturally yield: 

Theorem 1.7 For the random geometric graph G n as in Section \1.1\ (with any distance 
function r = r(n) = o{l)), we have x(G n ) / Xf(G n ) ~^ 1 a - s - 

1.5 Generalised scan statistics 

For a set V of points in M d and a nonnegative function tp : ~R d — > R we define M(V, if) by: 

M(V, v?) := sup — x). 

This quantity plays a central role in our analysis, as do the random variables 

n 

M„ = M v (n, r) := sup ¥ { r ~ lx i ~ x ) 

where we have scaled the Xi by r _1 . Thus if the points Xi,...,X n are distinct, and 
V = {r- 1 ^!, . . . , r^Xj then M v (n, r) = M(V, ^). 
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Feasible functions (p correspond to feasible solutions to the dual of the LP for the 
fractional chromatic number. In the special case when tp is the indicator function l\y of 
some set W C M. d , we denote M v by My/- We denote the number of indices i G {1, . . . , n} 
such that Xi E W by N(W) = X n (W); that is, N(W) = EILi WW- (We will often omit 
the argument or subscript n for the sake of readability.) Notice that Mw is the maximum 
number of points in any translate of rW; that is My/ = max x 'N(x + rW). The variable 
M w is a scan statistic (with respect to the scanning set W), see for example the book [I] 
by Glaz, Naus and Wallenstein: we call M v a generalised scan statistic. 

We say that a set W C M d has a small neighbourhood if it has finite volume and 
lim^o vol(W e ) = vol(W), where W e = W + £-6. Then for sets W with finite volume, W 
has a small neighbourhood if and only if W is bounded and vol(cl(W)) = vol(W), where 
cl(.) denotes closure. In particular all compact sets and all bounded convex sets have small 
neighbourhoods (and the choice of the norm ||.| is not relevant). We say that a function 
tp : M. d — > K. is tidy if it is measurable, bounded, nonnegative, has bounded support and 
the sets {x : <p(x) > a} have small neighbourhoods for all a > 0. 

The proofs of the above theorems rely heavily on the following limiting result concerning 
the generalised scan statistic M v for a tidy function tp. 

Theorem 1.8 Let v be a 'probability distribution on M. d with finite maximum density a; let 
X±,X2, . . . be independent random variables each with distribution v; and let r = r(n) > 
satisfy r(n) — > as n — > oo. If ip is a tidy function and if t{n) := satisfies 
liminf n t(n) > then 

— ^~£(¥>,t(n)) a.s. 
cmr a 



1.6 Plan of proofs 

In the next section, Lemma [2.11 gives basic results on the weighted integral £(</?,£) which 
we use throughout the paper. The following section, Section [31 contains the proof of 
Theorem 11.81 on generalised scan statistics, and includes some more detailed lemmas on 
this topic which will be needed later. These two sections are quite technical: they could 
be skipped on a first reading, and referred back to as needed. 



In the short Section H] we give a quick proof of parts (iii) and (iv) of Theorem 11.21 



on u)(G n ), using Theorem 11.81 (together with Lemma 12. ip . In Section [SJ we come to the 
heart of the proofs on x(G n ): we first prove some deterministic results on x an d Xf f° r 
geometric graphs, and on feasible functions tp and the weighted integral £; and then we 



deduce parts (iii) and (iv) of Theorem 11.11 on x{G n ) and Theorem 11.31 on x(G n )/o;(G n ), 
using Theorem 1 1 . 8 1 ( t o get her with Lemma [2TT]) . In the next section, Section [61 we show that 
f X i /w: /x/w an d have the properties claimed in Theorems ll.l[ 11.21 and [Ol Here, as well 
as using Lemma 12.11 and Theorem 11.81 we bring in some detailed lemmas on generalised 
scan statistics from Section [3j In Section [7] we complete our proofs, and finally we make 
some concluding remarks. 
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2 The weighted integral - basic results 



Here we collect some useful observations about the weighted integral defined in 
Section 11.31 Throughout the paper, we will usually omit the domain we are integrating 
over and simply write J ip instead of j Rd ip(x)dx. All integrals in this paper are over M. d 
(and wrt the d-dimensional Lebesgue measure) unless explicitly stated otherwise. The 
following lemma lists a number of basic properties of £(ip, t). We will make frequent use of 
these properties in the rest of the paper. 

Lemma 2.1 Let p and ip be non-negative, bounded, integrable junctions on ~R d , and let 
t e (0,oo], 

(i) Iftp<if> ihen£(tp,t)<£(i[>,t). 

(ii) f(A<p,f) = A £(<p,t) for any A > 0. 

(iii) £(<p + il>,t) <£(<p,t)+£(ip,t). 

(iv) ForO < \ < 1 let ip\ be given by <p\(x) = <p(\x). Then £(ip,t) < £(<p\,t) < \~ d !;(tp,t). 

(v) ^ £(</?, t) < £(<p, t + h) < £(<p, t) for < t < oo and h > 0. 

(vi) If J y?l{ v >a} < / ^l{v->a} for all a then £(<p,t) < £(tp,t). 
(vii) £(<p, t) — > J p = £(<p, oo) as t — > oo. 

(viii) Let tpx, tf2, . . . be non-negative, bounded, integrable functions on M. d , and suppose that 
tp n — > p pointwise as n — > oo, and p n < ip for all n. Then £(p n ,t) — > C,(p,t) as 
n — » oo. 



The case t = oo is always trivial, so in the proofs we will only consider the case when 
t < oo. On several occasions, in the proof below and later, we will differentiate an integral 
over x G M. d with respect to a parameter u and swap the order of integration. In all 
cases this can be justified by means of the fundamental theorem of calculus and Fubini's 
theorenjE A function is simple if it takes only finitely many values. We prove the parts of 
the lemma in a convenient order. 



Proof of | (vii) \ If < t < t' < oo then s(ip,t) > s(p,t') so £(ip,t) > £((p,1f). Thus |(vii) 



follows from the monotone convergence theorem. 

1 Here we mean the following. If g(x,u) denotes one of ip{x)e u ^ or H(e u eW) then f Rd g(x,u) - 
g(x, 0)dx = f Rd Jg" g2(x, w)dwdx = J™ f Rd gi{x, w)dxdw, where 172 denotes the derivative of g wrt. the sec- 
ond argument, and we have used Fubini's theorem to switch the order of integration. Now the fundamental 
theorem of calculus shows that ^ f Rd g(x, u)dx = ^ / Rd g(x, u) - g(x, 0)dx = ^ J Q " / Rd g 2 (x, w)dxdw = 
f M 4 g2(x,u)dx. 
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Proof of (i) : If we differentiate the equation t J H(e Sip ) = 1 wrt t we find: 



<p 2 e">>, 



which gives 



t 2 s f ip 2 e SL P' 

(That s is different iable can for instance be seen from the implicit function theorem^.) 
Thus 

Now notice that (p < ip implies s((p,t) > s(ip,t), so that for all < t < oo: 

^>>*)>^M), 



which implies that £(ifi,t) — £) is non-increasing. Finally, by (vii) we have 
so that £(il>, t) > £(</?, t) for all t > 0. 



Proof ofp)] We must have s(Xip,t) = s(<p,t)/\ as / H(e s{ ^) = j H{e s ^^ Xv ) = \. 



So indeed £(\<p,t) = J Xtpe"^*^ = \J ipe'^v = \£(ip,t). 



Proof of (viii)f First, let s > be fixed but otherwise arbitrary. Observe that H(e sipn ) < 



H(e s ^). Since we also have j H(e s ^) < oo (as observed in section [L3|) . the dominated 
convergence theorem gives 



lim / H{e stpn ) = / H{e stp ). 

n^oo J J 

This shows that lim^oo s(tp n ,t) = s(<p,t). Thus, for all e > and n sufficiently large: 



As J ipe^^'*^^ < oo the dominated convergence theorem also gives that 



J (peMvd-e)* < hminf £(<p n ,t) < lim sup £(</?„, t) < J 



2 Set F(t, s) := J Rd H{e slp{x ' ) )dx-\. Observe that j^F ± for all s > 0. The implicit function theorem 
now gives that for every t > there are neighbourhoods U of t and V of s((p,t) and a unique function 
g : U —>V such that F(t', g(t')) = for all t' G U\ and moreover this g is continuously differentiable. Thus 
s((p,t') = g(t') for all t' G U and in particular s(ip,t') is differentiable at t' = t. 
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Two more applications of the the dominated convergence theorem now yield 



lim / tpe 



(s(<p,t)-e)<p 



lim / ipe 

e->0 ' 



(s(<p,t)+e)<p 



giving the result. 



Proof of (iii)f By (viii) it suffices to take <p and ip as simple functions. What is more, 
we can assume without loss of generality that supp(y?) = supp(^). Hence we can assume 
that there are disjoint sets A\ % — 1, . . . , n such that <p = £\ Oil^ and ip = frjly^ where 
each di > and bi > 0. Let so = s(tp + ip,t) and define ctij, /3, and ji by setting 1/otj = 
f A H(e s ^), 1/A = / Aj H(e s ^) and 1/ Ti = H(e s °^ +b >). Clearly < 7l < A < 00. 
Denote (a, + fe^l^ by #3, and note that s{gi,ji) = so- Hence 



But now by (ii) 



i i 
i i 



completing the proof. 



Proof of (vi)[ It suffices to show that s((p,t) > s(ip,t) for all t, because then the 
argument given in the proof of part (i) will give the result. Therefore it also suffices to 
show that 

H{e s ^ x) )dx < [ H{e s ^)dx (13) 

JR d 

for all s > 0. It is straightforward to check that F(y) := j- (H(e sy )/y) > for all y > 0. 
But 



H(e sy ) = y / F(z)dz 



F(z)yl y > z dz, 



and so 



OO POO 



H{e s ^)dx = / / F{z)ip{x)l v{x) > z dzdx. 
Jo Jo 
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We may swap the order of integration since all the quantities involved are non-negative. 
Hence, using the fact that F(z) > 



x 



< 



F(z) ij (p(x)l^ x )> z dxj dz 
F(z) (J i/;(x)l^ x) > z dx\ dz 
H(e s ^ x) )dx, 



so that (fTBl holds, as desired. 



Proof of (iv) : Note that the substitution y = Xx gives that: 

- = J H(e s ^ Xx) )dx = X~ d [ H 

t JR d JR d 



stp(y) 



)dy, 



so that s(<p\,t) = s(<f, X d t). Using the same substitution we get 

t{<p x , t) = X~ d [ V (y)e s ^ x ' dt ^dy = A" d t). 



The upper bound now follows from the fact that A d > 1 and that s(tp,i) is decreasing 



in t. The lower bound follows from part (vi) because J <^Al{^ A >a} = A d J fl{ v > a } for 
all a (again by the substitution y = Xx). ■ 



Proof of (v) : Let A = (^f^) d ■ Then < A < 1 and so by (iv) and its proof 



tin t) < t(<p x , t) = x- d a<p, = t+h). 

Also, we have already seen that t + h) < £((p, t). ■ 

We have now completed the proof of Lemma 12.11 

3 Proofs for generalised scan statistics 

In this section, after some preliminary results, we consider generalised scan statistics first 
in the sparse case, then the dense case, and finally prove Theorem 11.81 on the limiting 
behaviour of M, 



Here is a rough sketch of the main idea of the proof of Theorem ll.8l when anr ~ t Inn. 
Consider first the special case if = lw Since there are about r~ d disjoint scaled translates 
rW of W where the probability density is close to a, we see that My? behaves like the 



maximum of about r~ 



n 



independent copies of the number Z of points X%, . . . , X n 
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in a fixed scaled translate rW where the probability density is close to a; and Z is roughly 
Po(A) where A = vol(W)anr d ~ vol(W)t Inn. Large deviation estimates show that My/ 
will be about cA where c > 1 satisfies P(Po(A) > cA) ~ and this happens when 

H(c)X ~ Inn, that is #(c)vol(W)t ~ 1. 

For the general case concerning M^, it suffices to consider a step function <p = £\ a^l^ 
where the sets Aj are disjoint. If Zi corresponds to rAi just as Z corresponded to rW 
above, then M v behaves like the maximum of about r~ d independent copies of ^cijZj, 
and we may proceed as above. 

3.1 Preliminaries 

We need results on the maximum density a and disjoint 'dense' sets. The first lemma is 
from Miiller [15] . and is a straightforward consequence of the fact that the set of points 
where v has density at least (1 — s/2)a has positive measure. 

Lemma 3.1 Let W C M. d be bounded with positive Lebesgue measure and fix e > 0. 
Then there exist Q(r~ d ) disjoint translates x\ + rW, . . . ,x^ + rW of rW with v[x,i + 
rW) I vo\(rW) > (1 - e)a for all i = 1, . . . , N. 

This last result extends to: 

Lemma 3.2 Fix e > and let W C M d be bounded and let Wi, . . . , Wk be a partition of 
W with vo\{Wi) > for all i. Then there exist Q(r~ d ) points x%, . . . , x^ such that the sets 
Xi + rWj are pairwise disjoint and v{xi + rWj)/ vo\(rWj) > (1 — e)o~ for all i = 1, . . . , N 
and j = 1 , . . . , k . 

Proof: Set pi := and p := mhijpj. By Lemma EH] there exist points Xi,...,Xn 

with N = Q(r~ d ) such that the sets Xi + rW are disjoint and satisfy v[x,i + rW) > 
(1 — pe)avol(rW). By construction the sets Xi + rWj are disjoint. We now observe that 
v{xi + rWj) must be > (1 — e)a vol(rH^), because otherwise 

v[xi + rW) < (1 - pj)a yo\(rW) + (1 - e)pja vo\(rW) 
= (1 — PjE)o vol(xj + rW) < v{x,i + rW), 

a contradiction. ■ 

For the proofs in this section we will also need some bounds on the binomial, Poisson and 
multinomial distributions. The following lemma is one of the so-called Chernoff-Hoeffding 
bounds. A proof can be found for example in Penrose [T6j . 

Lemma 3.3 Let Z be either binomial or Poisson with fi := KZ > 0. 

(i) If k > fi then F(Z > k) < e~^ H{ ^ . 

(ii) If k < fi then P(Z < k) < e~^ H( ^ . 
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Often the upper bound given by Lemma 13.31 is quite close to the truth. The following 
lemma gives a lower bound on P(Po(/i) > k) which is sufficiently sharp for our purposes 
(see Penrose [16] for a proof). 

Lemma 3.4 For k, /i > it holds that P(Po(/i) = k) > ^Jg-e"^^. 

A direct corollary of lemmas 13.31 and [3.41 is the following result: 

Lemma 3.5 For a > 1 it holds that P(Po(//) > a/S) = e -^(«)+o(M) as ft ^ oo. 

Another bound on the binomial and Poisson that will be useful in the sequel is the following 
standard elementary result (see for example McDiarmid |12j). 

Lemma 3.6 Let Z be either binomial or Poisson and k > fi := KZ . Then 

rJL)* < wz >k)< (^) k . 
ek k 

We will also need the following result from Mallows [10] on the multinomial distribution: 
Lemma 3.7 Let (Zi, . . . , Z m ) ~ mult(n;pi, . . . ,p m ). Then 

p(Zi <k 1 ,...,z m <k m )< n™ x p(z 4 < h). 

3.2 Sparse case 

Here we consider the behaviour of Mw in the 'very sparse' case, and then the 'quite sparse' 
case. For suitable sets W we find results on Mw that do not depend on W. 

First we introduce a convenient piece of notation. If A is an event then we say that 
A holds almost surely (a.s.) if F(A) = 1, and if Ai,A 2 , ... is a sequence of events then 
{A n almost always } denotes the event that "all but finitely many A n hold". We will 
frequently deal with the situation in which F(A n almost always ) = 1, which we shall 
denote by A n a. a. a.s. (A n almost always almost surely). We hope this is a convenient 
shorthand, which avoids clashes with the many different existing notations for F(A n ) = 
1 + o(l) (a.a., a.a.s., whp.) that are in use in the random graphs literature. The reader 
should observe that A n cl.cl.cL.S. IS cL much stronger statement than A n a.a.s. Observe that 
the conclusion of Proposition 11.61 is that in the case considered there we have x{G n ) = 
oo(G n ) a. a. a.s. We will need the following lemma in order to prove Proposition 11.61 

Lemma 3.8 Let W C M d be a measurable, bounded set with nonempty interior, and fix 

keN. 

(i) If nr d < n~ a with a > \ then M w < k a. a. a.s.; 

(ii) If nr d > with < -r—r then Mw > k a. a. a.s. 
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Proof of part (i) : We first remark that it suffices to prove the result for W a ball. This 
is because W is bounded and hence we must have W C B(0; R) for some R > (and hence 
also My/ < Mb(o-,r), so that Mb(o,r) < k implies Mw < k). Furthermore, since W is a ball 
it is clear that Mw is non- decreasing in r and we may assume without loss of generality 
that r is chosen such that nr d = n~ a . If some translate of rW contains k + 1 points, then 
some Xi has at least k other points at distance < 2Rr. Hence 

F(M W > k + 1) < P(3z : yf(B(Xi] 2Rr) > k + 1) < n¥('N(B(X 1 ; 2Rr)) >k + l). 

Note that 

F(N(B(X 1 ;2Rr)) >k + l) < F(Bi{n,avol{B)2 d R d r d ) > k) < ( g™iggfgzgf )fc 

= 0(n~ ka ), 

where we have used Lemma [3.61 As a > t, we have a' := ka — 1 > 0. We find 

F(M w >k + l)=0(n~ a '). 

Unfortunately this expression is not necessarily summable in n so we cannot apply the 
Borel-Cantelli lemma directly. However, setting K := [A] + 1, we may conclude that 

P(Miy(r?r Rr , r(m K )) < k for all but finitely many m) = 1, 

because now claim that from this it can be deduced that Mw < k 

a.a.a.s. Note that 

r((m-l) K ) 



lim 



r [m 



because nr d = n a . Consequently 7 := sup m r ^™ m K) - < 00 ■ By the previous we may also 
conclude that 

P(M 7 n/(m A ', r(m K )) < k for all but finitely many m) = 1. 

Let n, m be such that (m — 1) < n < m x . Note that for any x G M d it holds that 
x + r(n)W C x + ^r(m K )W as r )r{m K ) > r((m — 1) K ) > r(n). In other words if M^(ri) > 
+ 1 then also M 7 M/(^ i<: ) > k + 1. Thus it follows that 

F(Mw{n, r(n)) < k for all but finitely many n) = 1, 

as required. ■ 

Proof of part We may assume that (3 > 0. Also, we may again assume that 

W is a ball. This is because W has non-empty interior and it must therefore contain 
some ball B, so that it suffices to show Mb > k a.a.a.s. Again, by the fact that Mw is 
non-decreasing in r (when W is a ball) we may assume wlog that r is chosen such that 
nr d = n~P. By Lemma 13.11 we can find disjoint translates W\, . . . , Wn of rW satisfying 
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v(Wi) > (1 — s)a vol(W)r d where N = Q(r~ d ). Now notice that the joint distribution of 
(N(Wi), . . . , 3\f(Wjv), N(M d \ UjWj)) is multinomial, so that we can apply Lemma [3TT1 to see 
that 

P(Mvk < k - 1) < P(K(Wi) < fc - 1, ... , N(Wjv) < Jfe - 1) < n? =1 V(N(Wi) <k-l). 
The (marginal) distribution of N(Wj) is Bi(n, z/(Wi)), so that Lemma [3.61 tells us that 
P(N(Wi) > fc) > ( ^(l-^Wr" )fc = cn _ fc , 



Thus 



F(M W < fc - 1) < (1 - cn~ kp ) N < exp[-cn- k/3 N]. 

As nr d = n~P we have r~ d = n 1+/3 . As < tzt we also have /?' := 1 + (3 — kf3 > 0, so that 
n-^N = n(nP'). Thus 

F(M W <k — l)< exp[-fi(r/)], 

which is summable in n. It follows from the Borel-Cantelli lemma that Mjy > k a.a.a.s. 
as required. ■ 

We now move on to consider the 'quite sparse' case. 

Lemma 3.9 Let W C ~R d be a measurable, bounded set with non-empty interior and fix 
< e < 1. Then there exists a (3 = f3(W, a, e) > such that ifn -13 < nr d < /31nn then 



[1 — e)k{n) < M w < (1 + e)k(n) a.a.a.s., 



with k{n) = lnn/ln(^ 



Proof: As in the proof of the previous lemma we may again assume that W is a ball. 
Set k{n) := lnn/ln(j^§). Let us first consider the lower bound. Completely analogously 
to the proof of Lemma [3.81 part (ii) we have 



F(M W < (1 - e)k) < (1 



i(n,Cr d ) > (1 - e)k)) n{r ' d) 



* ^W(^) (1 - e)l >] 

= exp[-fi(r- d exp[-(l - e)k(\n(^) + D) 



with C := (1 - e)avol(W), D := ln(^%^). By choice of k: 



k{\n{ 



nr u 



+ D) 



In n 

In n > 

nr d > 



ln( 
Inn 



. .Inn, . ,lnn., 

H— T )-HH— i)) + D 



nr u 



nr u 



, Tnn.. . , Jnn N „ . , ,lnn, 
l-ln(ln(— ))/ln(— ) + D/\n(- 



nr u 



nr u 



nr u 



(14) 



(15) 
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If n & < nr d < (3\nn then ^ > i. Thus, if j3 > is chosen small enough then: 



P 

k ( D — ln(ln(^))\ 

fcflnf— j + D) = 1 + , \ wr Inn < Inn. 16 

1 K nr*> } \ lu(ip) y K } 

Also note that r" d > = n 1+o(1) . Combining this with flUD and (TTB|) . we get 

P(M W < (1 - e)Jfe) < exp^^rt^ 1 "^ 111 ™)] = exp[-fi(r- d n- 1+£+ °W)] 
< exp[-n £+ °W]. 

This last expression sums in n, so we may conclude that My/ > (1 — e)k a.a.a.s. if 
n~P < nr d < j3Yan for (3 > sufficiently small. 



Let us now shift attention to the upper bound. As in the proof of part (i) of Lemma [3T81 
the obvious upper bound on F(M W > (1 + e)k) does not sum in n. Unfortunately the trick 
we applied there does not seem to work here and we are forced to use a more elaborate 
method. For s > let us set 

In Ti 

M(n, s) := max 3\f(x + sW), k(n, s) := Inn/ ln( — -). 

xeR d ns d 

Note that k(n,s) is non- decreasing in n and s and so is M(n,s) (because W is a ball). 
The rough idea for the rest of the proof is as follows. We fix a (large) constant K. Given 
n we appoximate n by m K , chosen to satisfy (m — 1) A < n < m K , and we approximate r 
by s > r, which is one of O(lnm) candidate values s±, . . . , sjsrt m ), in such a way that 

(a) M(m K , s) < (1 + l)k{m K , s) a.a.a.s. 

(b) {I + f)k{m K ,S) < (l + e)k(n,r); 

Note that M(m K ,s) > M(n,r(n)) because m K > n,s > r and W is a ball, and that 
combining this with items (a) and (b) will indeed show that M(n, r) < (1 + e)k a.a.a.s. 
The reason we have chosen this setup is that if the constant K is chosen sufficiently large 
we will be able to use the Borel-Cantelli lemma to establish (a), making use of the fact 
that we are only considering a subsequence of N and s is one of O(lnm) candidate values. 

Let us pick s\{n) < S2(n) < . . . such that k(n,Si(n)) = i. Let us denote by A{n) the 
event 

A(n) := {M(n, s;(n)) > (1 + -)i for some 1 < i < I{ri)}, 

with I(n) := lnn/ln(^), the value of k(n, s) corresponding to ns d = 2/3 Inn, where j3 = 
(3(e) is to be determined later (note that k(n,s) = i implies that ln(^) = ^). By 



computations done in the proof of (i) of Lemma [3.81 we know that 

(C 1 d \ (1+f ) fc ( n i s ) 
J^L- ) = ne" (1+ i )KMM*&)+D) (17) 

k(n,s)J 
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for appropriately chosen constants C,D. We may assume wlog that D < 0. By ffT5]) we 
have that 

'H",s)\ , m lnln(|g) , D . 



V A V ns J ln(^) 



ln(^) 

If si < s < then lnn/(ns d ) > Hence, by taking /3 sufficiently small we can 
guarantee that for si < s < sm: 

since Z) < 0. By (JTTJ) we have that for Si(n) < s < S|/( n )j(n): 

P(M(n,a) > (1 + -)k(n,s)) < ne -(i+f)KM(M^)+o) < n -,/3+ (i)_ 
It also follows that 

P(A(n)) < J(n)n- £/3+o(1) = n- £/3+o(1) . 

This last expression does not necessarily sum in n, but if we take K such that Kef 3 > 1 
then we can apply Borel-Cantelli to deduce that (a) holds; that is 

¥(A(m K ) holds for at most finitely many m) = 1. 

Now let n e N be arbitrary and let the integer m = m{n) be such that (m — l) K < n < m K . 
Let i = i(n) be such that Si(m K ) < r(n) < Si + \(m K ). We first remark that if rT^ < nr d < 
/31nn then [m K )~ 13 < m K r d < (-^) K ^ ln(m x ), giving: 

v ' < k{m K ,r) < -(1 + o(l))- V 



/3 " v ' y - v ln(/3) • 

So for n sufficiently large, we must have < i < I{m K ). 

To complete the proof we now aim to show that (for n large enough), If M(n,r(n)) > 
(1 + e)k(n, r(n)) then 

M(m K , s i+1 (m K )) > (1 + £ -)k{m K , s i+1 (m K )) = (1 + + 1). 

It suffices to show that (l+e)/c(n, r(n)) > (l+|)(i+l), because M(n, r(n)) < M{m K , Si + \{m h 
(since W is a ball). Routine calculations show that 

k(n, r{n)) > k((m - 1) K , Si{m K )) ~ k{m K , Si{m K )) = i; 

and, assuming that /3 < e/6, for n sufficiently large 



(1 + e)k(n, r(n)) > (1 + | )(1 + 20)i > (1 + |)(z + 1) 



as required. 



Lemma 13.91 also allows us to deduce immediately the following corollary, which extends 
lemma 5.3 of the first author [12] and may be of independent interest. 
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Lemma 3.10 Let W C M. d be be a measurable, bounded set with non-empty interior. If 
for each fixed e > we have n~ e < nr d < e In n for all sufficiently large n, then 

In n 

Mi4/~lnn/ln( — -) a.s., 

nr d 

3.3 Dense case 

The 'dense' case is when anr d / Inn is large. Let us sketch our approach to proving this 
result. We first show that it suffices to consider a simple function </?. Also we "Poissonise" ; 
that is, we consider random points X±, . . . ,Xn where N has an appropriate Poisson dis- 
tribution. To prove the lower bound, we use Lemma 13.21 to see that there are points Xi so 
that we can translate to regions with density near a; then we lower bound M v by a linear 
combination of independent Poisson random variables (indeed by independent copies of 
such linear combinations), and use Lemma 13.31 to complete the proof. The proof of the 
upper bound is more involved. We first find a suitable simple function <p v > tp and close to 
tp. Consider a cell C = [-R/2, R/2) d containing the support of tp v , and partition M d into 
the sets T(x) = x + rKL d of translates of the points x G rC. Let the random variable U 
be uniformly distributed over rC and let M(U) be 

N x _ 

max y^^ni— )• 

Because of the way ip v was chosen, it suffices to upper bound F(M(U) > (1 + e)k). To do 
this we show that for each possible u 

F(M(u) > (1 + e)k) = 0(r- d ) ■ ¥(Z > (1 + e)k) 

where Z is a linear combination of independent Poisson random variables, and finally we 
use Lemma [3.31 to complete the proof. 

Lemma 3.11 Let if be a tidy function. For every e > there exists a T = T(<p,e) such 
that if o~nr d > T In n then 

(1 - e)k < M v < (1 + e)k a. a. a.s., 

where k = o~nr d J (p. 

Proof: Let us first observe that it suffices to prove the result for (p a simple function, 
because the functions tp we are considering can be well approximated by the functions 

</^ wer , <^P per defined by: 

[m-max ip] [m-max ip~\ 

, lower . { \1 ,„uppcr . / * \i 

k=l k=l 
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Here we mean by "well approximated" that y>^ wer — V 9 ^ V 9 ™ per f° r m an d 

lim / = lim / vT° r = [<P- (18) 



m— >oo 



Observe that ( 1T81) follows from the dominated convergence theorem (since ip is bounded 
and has bounded support). Also observe that the sets { ( ^ pcr > a} = {(f > ^^-} and 

{</?j7 er > a} = {ip > have a small neighbourhood for all a. 

Clearly M^wr < M v < Mapper. Thus the result for non-simple functions will follow 
from the result for simple functions by taking m such that J tp^ pcr — J <^ wcr < 1 1 ¥ an d 
setting T := max(T!,T 2 ), where 7\ := T(ip^ pcr , |) is the value we get from the result for 
simple functions applied to p^ pcr with | and T 2 := T(</^ wer , |) is the value we get from 
the result for simple functions applied to <p l ™ ei with |. 

In the remainder of the proof we will always assume that ip = Y^iiLi a i^-Ai is a simple 
function with the sets Ai disjoint and bounded and that {</? > a} has a small neighbourhood 
for all a. Let us set 

k = k{n) := anr d J p. (19) 

It remains to show that (1 — e)k < M v < (1 + e)k a.a.a.s., whenever anr d > Thin for 
some sufficiently large T. 

Proof of lower bound: Let N ~ Po((l — Yoo) n ) ^ e independent from Xi, X 2 , .... It will 
be useful to consider X%, . . . ,X N (rather than X±, . . . ,X n ), because they constitute the 
points of a Poisson process with intensity function (1 — -^)nf (where / is the probability 
density function of u), see for example Kingman [8]. 

By Lemma 13.21 there are Q(r~ d ) points x±,...,xk such that v{xi + rAj) > (1 — 
3^5)0" vo\(Aj)r d for all 1 < i < K, 1 < j < m and the sets + rAj are disjoint. For 
the current proof we will only need that K > 1, but the fact that K = Q(r~ d ) will be 
needed for the proof of Theorem 11.81 which proceeds along similar lines to the current 
proof. Let us set Mj := J2jLi { Pi~ J ~f~ L )} so that 

Mi = a^ N (xi + rAi) H h a m K N {xi + rA m ), 

where Njv(-B) = ^2 i=1 lx^es denotes the number of points of the Poisson process in B. 
Note that N/v(xj + rAj) is a Poisson random variable with mean at least 

£ N2. 



Setting 



^' :=(1 ~ iW Vvol( ^ )nr • 



N X -T 

K := -pE^: 



a-eyiv- i=1 

we have 



P(M' < (1 - e)k) < P(Mi < (1 - e)k, ...,M K <(1- e)k) = n^ 1 P(M i < (1 - e)k) 
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where in the last equality we have used that distinct Mi depend on the points of a Poisson 
process in disjoint areas of M. d and hence the Mj are independent. If Z = a\Z\ + • • • + 
a m Z m with the Zj independent Poisson random variables satisfying KZj = fij then 
stochastically dominates Z, so that: 

P(M V < (1 - e)k) < F(M' V < (1 - e)k) + P(JV > n) < P(Z < (1 - e)k) K + ¥(N > n), 

and consequently, by Lemma 13.31 

P(M„ < (1 - e)k) < P(Z < (1 - e)A;) x + e" ar \ (20) 

where a := (1 — ^)//"( 1 E ). But K = f2(r _d ) is > 1 for n sufficiently large, and then 

iuu i 100 

P(M V < (1 - e)k) < P(Z < (1 - e)k) + e" Qn . Further, using Lemma 



P(Z<(l-e)A;) < E"i p (^< 7T3^w^ 

\ 100' 

< m ■ maxi P(Po(/jj) < ,,}ZL\2 ^ 

V 1 100 ' 

< m ■ expl — min^ u^ff ( j 1- / 

V"- 1 100 



100 



Now suppose that T has been chosen in such a way that (and we may suppose this) 

It follows that E n P(M v5 < (1 — s)k) < rnJ2 n n~ 2 + J2 n e~ an < oo, which concludes the 
proof of the lower bound. 

Proof of upper bound: We may assume wlog that a± > a-i > ■ ■ ■ > a m > 0. Recall that 
A v denotes A + 5(0; rf) = U a£j 4-E>(a; rf). For 77 > let <f v be defined by 



<p v {x) 



at iix e{A l ) v \\J j<i {A j ) v , 
if x (Ai) v for all 1 < i < m. 



and let r\ be chosen such that J (p v < (1 + joo) / V 9 - This can be done, since <p is tidy and 
so the sets A{ have small neighbourhoods. Thus we can choose 77 so that 

vol((A)„ \ LMW < (i + 155) vo1 (A) (21) 

for all i, and then we also have f <p v = Y.i a i vo K( A i)v\Uj<i( A j)v) ^ ( 1+ Too) Si a ' vol (^i) = 
(1 + /<£>. Clearly ip n {x) > <p(x) for all x giving > M v . 

Similarly to what we did for the lower bound, let N ~ Po((l + y§q)^) be independent 
of the Xi and set 

N X -x 
M^max^H ). 

3=1 
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We have 



P(M V > (1 + e)k) < P(M; > (1 + e)k) + W(N < n) < ¥{M' V > (1 + e)k) + e~ an , (22) 

for some a > (where we have used the Lemma 13. 3j) . Again the points X\, . . . , are 
the points of a Poisson process, this time with intensity function (1 + -^)nf . 

Let R > be a fixed constant such that the support of (p v is contained in [-^, 
(R exists because we assumed the A, are bounded). Let U be uniform on [0,rR) d and let 
T(U) be the random set of points U + rRZ d (= {U + rRz : 2 G Z d }). For z G M d let M x 
be the random variable given by Y^jLi fvi^ 2 ^)- us define 

M(U) := max M z . 
zer(u) 

If ||j9 — q\\ < 7]r then tp n {^—^) > V 9 (^~ :2 ) f° r & U x by definition of t^. For any q G M d , the 
probability that some point of T(U) lies in B(q;rjr) equals 

¥(T(U) n B(q; V r) ^ 0) = vol ( 5 ^ 



(We may assume wlog that R is much larger than tj.) Because X^j=i ^(^7 — ) — Sj=i J r ^ > 
whenever ||x — y|| < 777", this gives the following inequality: 

F(M(U) > (1 + e)k\M' v > (1 + e )fc) > vol g)^ . 

We find: ^ 

P(M; > (1 + e )*) < ^g P(M([/) > (1 + e)k). (23) 

Let us now bound F(M(U) > (1 + £)&). To do this we will condition on C/ = u and give a 
uniform bound on P(M(w) > (l + e)k). The random variables M z , z G T(u) can be written 
as a\M Zt i + • • • + a m M z>m with the M Zyi independent Poisson variables with means 

EM Z)i < (1 + ^) 2 vo l(A i )an/ =: ^. 

Let us partition Y{u) into subsets r 1; . . . , Y K with if = 0(r~ d ) such that 

EM z,i < Ik for all z G { 1 , . . . , m}. (24) 

To see that this can be done, notice we can inductively choose maximal subsets Tj C 
T(n) \ Ui'<i Fj> with the property ^] 2gr ,EM v < fa for all z G {l,...,m} (where by 
maximal we mean that the addition to Tj of any z G" Uj'<j ^-Y would violate this last 
property). With the Tj chosen in this way, we must have that Tj U {z} violates one of the 
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constraints (124)) for any z G Tj+i. Thus, in particular Y^iLx X^er.,-ur i+1 ^^,t > rnhij/ij if 
r j+1 7^ 0. Consequently, if we were able to select K subsets Tj we must have 

K m 



K — 1 

L— — J min^ < J2 E E EM *>' ^ ( X 



5 ^ 

n, 



100' 

where the second inequality follows because the correspond to the number of points of 
a Poisson process of total intensity (1 + -^)n in disjoint regions of M d . So we must indeed 
have K = 0(r~ d ), and that the process of selecting Tj must have stopped after 0(r~ d ) 
many Tj were selected. 

Set M Vj := J2 z &t, M z- A s T(u) = \J. Tj we have 

M(u) = max M z < maxM r .. 

z£T(u) j 3 

Note the My are stochastically dominated by Z = a\Z\ + ■ ■ ■ + a m Z m , where the Zj are 
independent with Zi ~ Po(/ii). Thus 

¥(M(u) > (1 + e)fc) < KF(Z > (1 + e)fc). 

Because this bound does not depend on the choice of u we can also conclude 

¥(M(U) > (1 + e)k) < KF(Z > (1 + e)k). (25) 

We then have: 



P(Z >(! + £)*) = P(E «^ > (TT^F ^ ^ ^r=i > (irft 



< m • exp [- min, /Ujif ( ( i+^)2 



using Lemma 13.31 Now suppose that T has been chosen in such a way that (and we may 
suppose this): 



r . (1 + _ rm , nvol( ^. „(__)> 3, 



so that 



expf-min/iitf ( q - _^ 2 )] < ri 



-3 



whenever anr d > Thin. Because K = 0(r d ) and anr d > Thin, we have that K = 0{n). 
By pi we then also have ¥(M(U) > (1 + e)k) = 0{n~ 2 ). Combining this with Q22J) 
and fl23|) we find 

P(M^> = 0(n~ 2 ). 

The Borel-Cantelli lemma now gives the result. ■ 
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3.4 Proof of Theorem CCS on M v 

Our next target will be to prove Theorem II .81 on the generalised scan statistic M v . We will 
do this along the lines of the proof of Lemma 13.111 We will however need a straightforward 
generalisation of a Chernoff bound to weighted sums of Poisson variables, which is given 
by the following lemma. 

Lemma 3.12 Let X\, . . . ,X m be independent Poisson variables with X{ ~ Po(Aj/i) where 
Xi > is fixed, and set Z := a\Xi + • • ■ + a m X m with ax, . . . , a m > fixed. Then for each 
fixed s > 0, as fi — > oo 

Proof: The moment generating function of Z (evaluated at s) is 

Ee sZ = UiEe aisXi = exp[^Ai/i(e a ' s - 1)]. 

i 

Hence Markov's inequality gives 

P(Z > p^XiCae* 3 ) = F(e sZ > e ^ Xa ^ s ) 
i 

i i 

= exp[-/i H^se^ - e a * s + 1)] 

i 
i 

On the other hand, 

P(Z > fi ^ie aiS ) > P(Xi > /iA ie ° ls , . . . , X m > /iA r 



= exp[-/i^A l i/(e^) + o( / i)], 

i 

using Lemma [3.51 ■ 
Proof of Theorem 11.81 Suppose first that 

->• t G (0, oo) as n ->■ oo. (26) 



anr d 



Inn 

Let us observe that in this case the statement to be proven amounts to 



i^-^fof) a .s. (27) 
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We proceed as in the proof of Lemma 13. Ill Again it suffices to prove Theorem 11.81 for (p 
a simple function, because the functions tp considered can be well approximated by the 
functions </?m Wer , defined in the proof of Lemma 13.111 where this time we mean by 

"well approximated" that 



hm e(^ wer , t) = hm e(^ pper , t) = e^, ^ 



Observe that (1251) follows from part (viii) of Lemma 12.11 (tp is bounded and has bounded 
support). So the result for non-simple functions will follow from the result for simple 
functions by noticing that M^wer < M v < M^p^ for all m and taking m — > oo. 

Now let tp = YlT=i a i^-Ai be a tidy simple function with the sets Ai disjoint, and with 
< / ip < oo. Then s = s(tp,t) > solves J Rd H(e s ^ x) )dx = 1/t. Note that 

f Rd tp(x)e s ^dx = TZi ^ vol(4), 
f Rd H(e s ^)dx = YZi H ( eSai ) vol (A). 

Let us set 

k := £(p>,t)anr d . (29) 

Again it suffices to prove that (1 — e)k < < (1 + e)k a.a.a.s., for any e > 0. 

Proof of lower bound in ( \27\i : We proceed as in the proof of the lower bound in 

Lemma 13. Ill We restate (1201 from there as: 

P(M„ < (1 - e)k) < P(Z < (1 - e)k) K + e~ an , (30) 

where a > is a fixed constant, K = Q(r~ d ), and Z = a\Z\ + • • • + a m Z m with the Z^ 
independent Po(/x,) -random variables, where /ij := (1 — -^) 2 anr d vol(Ai) . We can write 

P(Z < (1 - e)k) = P(Z < ^ a,e sa >,) = P(Z < ]T ^'>,), 

where s' = s'(t, e) solves YlT=i a i eS ' ai vol(A) = Z^Hi Q i eSa ' vol (A)- Note that s' < 

s, and (provided e is small enough) also s' > 0. Lemma [3.121 now gives: 

m 

1-P(Z < (l-e)Jfe) = F(Z > (l-e)fc) = exp[-(l - ^) 2cmr<i E #( e<IiS ') vol(A) + o(l))] 



i=l 



As < s' < s we have that J2?=i H(e a * s ') vol(A) < £™ x #(e a * s ) vol^) = ± Conse- 
quently there is a constant c = c(t, e) > such that 

P(Z > (l-e)fc) = exp[-(l -c+o(l))lnn] = n" 1+c+o(1) . 

It follows that 

F(Z < (1 - £ )fc) A ' < (1 - n -i+°+*(i))* < exp[-J^n- 1+c+o(1) ] < exp[-n c+o(1) ], 
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using that K is at least n 1+ °^ (as K = f2(r~ d ) and r~ d ~ t^— ), we see that the right 
hand side of (130]) sums in n, so that we may conclude that M v > (1 — s)k a.a.a.s. by 
Borel-Cantelli. 

Proof of upper bound in ( I27|) : Let N, M' rj, tp v , M(U) be as in the proof of the upper 
bound in Lemma 13.111 where now r\ > satisfies ^(ip v ,t) < (1 + £/100)£(<£>, t); and recall 
from ((25]) that: 

F(M(U) > (1 + e)k) < KF(Z > (1 + e)k), 

where K = 0{r~ d ) and Z = a\Z\ + • • • + a m Z m with the Zj independent Po(/ij) random 
variables, where fa := (1 + -^) 2 vo\{Ai)anr d . We now have 

P(Z > (1 + e) fc) = P(Z > 1 + _f ^ Oie'-'/Xi) = F(Z > ^ a^'^), 

^ + ioo) i i 

where s' = s'(e, t) is such that YlT=i a i eS ' ai vol(A) = (ijzfcp YmLi ^ sai vol(Aj). Note that 
s' > s, giving J2i H(e s ' ai ) vol(Aj) > H{e SCLi ) vol(Aj) = ~, and consequently 

H(e s ' ai )fa = (1 + y^) 2 ™^ ]T # (e s ' a <) vol(A) = (1 + c + o(l)) Inn, 

for some c = c(e, £) > 0. Since K = 0(r~ d ) < n for n large enough we find that: 

F(M(U) > (l+e)k) < n¥(Z > (1 + e)k) = nexp[-(l + c + o(l)) Inn] = n" c+o(1) . (31) 

Unfortunately this does not necessarily sum in n, so we will have to use a more elaborate 
method than the one used in Lemma[3J~Tl Note that for any < rj < r\ we have, completely 
analogously to (1231 : 

P(K V > (1 + e)k) < voKB *l_ v , )d nM(U) > (1 + (32) 
By (E2l) and J22J) we also have that for all < rj < rj: 

P(M^ ?) , > (1 + e)k) < n~ c+o{1) + e^ Qn = n" c+o{1) . 

Although the right hand side does not necessarily sum in n, it does hold that if L > 
is such that cL > 1 then we can apply the Borel-Cantelli lemma to show that 

F(M V ,(m L ,r(m L )) < (1 + e)k(m L ) for all but finitely many m) = 1. (33) 

We now claim that from this we can conclude that in fact M 9 < (1 + 2e)/c a.a.a.s. To this 
end, let n e N be arbitrary and let m = m(n) be such that (m — 1) L < n < m L . The claim 
follows if we can show that (for n sufficiently large) 

{M Vn ,(m L ,r(m L )) < (l + e)k(m L )} {M v (n) < (1 + 2e)k(n)}. (34) 
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To this end we will first establish that (for n sufficiently large and) for any x, y: 

^(^)<^(^). (35) 
r{n) r{m L ) 

Since the support of if is contained in we are done if > diam([0, ^} d ) =: 7. 

If on the other hand II ^11 < 7 then ||J££--jfr§J| = < |i_lW| 7 = (i) 

11 r(n) 11 — ' 11 r(n) r(m^) 11 1 r(m lj ) * H r(n) H — I r(m lj ) I ' \ ' 

(because nr d ~ thin giving r(n) = (1 + o(l))r(m L )), so that for ra sufficiently large this is 
< rf and thus (135]) holds uniformly for all x, y (for such sufficiently large ra), as required. 
Since we also have k{n) = (l + o(l))k(m L ), equation ( |34l) does indeed hold for n sufficiently 
large, which concludes the proof of ( 1271) under the assumption ( 1261) where t > 0. 

Completing the proof: Now let us drop the assumption (I26p and prove the remaining 
parts of the theorem. Let t(n) := anr d /\nn be as in the statement of Theorem 11.81 Let 
r = liminf n = liminf n t{n) so r > by assumption. Let < e < |. We want to show 

1 _ £<liminf _it_< limsup _it_ <1 + £ a. s . (36) 
n crnr d t,{tp,t(n)) n crnr a t,{(p,t(n)) 



By Lemma [3JJ] and (vii) in Lemma [27T] there is a constant T < oo such that if lim inf n > 



T then ( |36l) holds. To cover the range between r and T we will use the following claim. 
Claim Let < t a < tb < oo be such that ^- < 1 + s. If r(n) is such that t a < tin) < tb for 

t a v y v 7 

ra sufficiently large then 

1 — e < lim inf y < lim sup v < 1 + e a.s. (37) 

n anr d t,(Lf,t(n)) n anr d t,{Lp,t(n)) 

Proof of Claim: Let us set 

n a = n a (n) := [(t a /t(n)) ■ raj, n b = n b {n) := \(t b /t(n)) ■ n\. 
By obvious monotonicities 

M v (n a (n),r(n)) < M v (n t r(n)) < M v (n b (n),r(n)) 



and by part (v) of Lemma 12.11 



Hence 



M v (n,r(n)) > M v (n a (n),r(n)) ^ 



an{r{n)) d £{(p, t(n)) an(r(n)) d £((p, t a ) 
and, since n a (n) ~ (t a /t(n)) ■ n, we also have 

M v (na(n),r(n)) M v (n a (n),r(n)) t a 



an(r(n)) d £(tp,t a ) an a (n)(r(n)) d £(<p,t a ) t(n)' 
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(39) 



Observe that an a {n){r{n)) d ~ a{t a /t{n))n{r(n)) d = t a \nn ~ t a \ian a (n). By the already 
proved special case of the result under the assumption (126|) applied to M(n a (n), r(n)), we 
therefore have 

M,(n a (n h r(n)) ^ ^ (4Q) 



an a (n)(r(n)) d £(ip,t a ) 

To be more explicit, we use the special case twice. Since e < \ and so > |, for each 
positive integer m the set {n : n a (n) = m} has size 1 or 2. We apply the special case once 
to M^m^r^m)) and once to M^{m,ru{m)) where r^m) := min{r(n) : n a (n) = m} and 
ru(m) := max{r(n) : n a (n) = m}; and finally we note that M^ria^)^^)) must be one 
of M^m^r^m)) or M (/P (m,r(7(m)) for each n where m = n a . 
Hence, combining (|38|) . ( |39|) . f l40|) . we also get 

liminf - — — — > liminf — ^— > — > 1 — e a.s. 

n anr d ^((p,t(n)) n t(n) U 

Completely analogously, lim sup w anr d^ t ^ < f 2 - a -s. This completes the proof of the 
claim 



Now we put the pieces together. Let m and t\ < t 2 < ■ ■ ■ < t m be such that t\ < r, 
t m > T and <l+efor/c = l,...,m — 1. For convenience let us also set t m+1 := oo. 
For each k = 1 m let 



r ( n ) if (^)* <r(n) < (± 
ftjjnny otherwise 

V an I 



r k ln 



Observe that for each n, M v (n,r(n)) equals Mpfo, rk(n)) for some k (that may vary 
with n). Since each M ¥ ,(n, Tfc(n)) separately satisfies fl36|) and the intersection of finitely 
many events of probability one has again probability one, M v (n, r) itself also satisfies ( 136]) . 
The theorem follows. ■ 



4 Proof of parts (iii) and (iv) of Theorem [L2] on uj(G n ) 

In this section we use Theorem 1 1.8 1 on generalised scan statistics (together with Lemma l2TTj) 
to give a quick proof of the following theorem, which is in a general form that is convenient 
for the proof of Theorem 11.41 later on. 



Theorem 4.1 Ift(n) 



Y^- satisfies hminf n t(n) > then 
uj(G n 



anr L 



£(<Po,t(n)) a.s. 
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Since we have already established in Section [2] that / w satisfies the properties claimed by 
part (iii) this implies parts (iii) and (iv) of Theorem 11.21 

Proof: Assume that liminf n £(n) > 0. First set W := -8(0; |). Any set of points contained 



in a translate of rW is a clique of G n , so that by Theorem 11.8 

u(G n ) ^ M w 



lim inf 



> lim 



n->oo anr d ^((po,t(n)) «->oo anr d ^((po,t(n)) 



1 a.s. 



Let us now fix e > and let Ai, . . . , A m C eZ d be all the subsets of eL d that satisfy G 
and diam(Aj) < 1 + 2ep, where p := diam([0, l] d ). Let Wi := conv(v4j). We now claim 
that u(G n ) < maxj M\y % - To see that this holds, suppose X^, . . . , Xi k form a clique in G n . 
Let us set yj := (X*. — X^/r and A := {p G eZ ' : \\p — yi\\ < ep for some 1 < i < k}. 
Observe that = y\ G A and diam(A) < 1 + 2ep, so that A = Ai for some 1 < i < m. 
What is more {y 1: . . . , C W :— conv(A). But this gives {X h , . . . , X ik ] C X^ + rW by 
choice of and the claim follows. 

We will need the Bieberbach inequality, which is sometimes also called the isodiametric 
inequality. (For a proof of this classical result, see for instance Gruber and Wills [6].) 

Lemma 4.2 (Bieberbach inequality) Let A C M. d be measurable and bounded. If A' is 
a ball with diam(A) = diam(A') then vol (A) < vol(A'). ■ 



l+2ep n 

lwi and ip denotes l B r .i± 



By the Bieberbach inequality vol(Wj) < vol(£>(0; 1+ ^ £p )), so that also J <£il{ w >a} 



< 



Also observe that ip(x) 



J ipl{^> a \ for all a where (fii 
yo( 1+ 2 ep )- By parts (vi) and (iv) of Lemma 12. II we therefore have maxj£(<^,t) < £(if),t) < 



2ep) £((fo,t) for any t G (0, oo). Hence for each % we have 



Mi 



U', 



< 



M 



£(</?o,*(n)) £(</?i,t(n)) 



;i + 2ep) £ 



and so by Theorem 11.81 



lim sup 



M 



f(^o,*(rc)) 



< (1 + 2ep) d a.s. 



Now by the claim established above we have lim sup „ ^^"L 
that 



< (1 + 2ep) d a.s. It follows 



anr d £(ip ,t(n)) 



—> 1 a.s. 



which completes the proof. 
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5 Proof of parts (iii) and (iv) of Theorem 11.1 



As we mentioned earlier, in Theorem II. II the same conclusions will hold if we replace x{G) 
by the fractional chromatic number Xf{G) ,and this is the key to the proof. We first give 
two deterministic results on xj an d X f° r a geometric graph. Given a finite set V G R d , 
with say \V\ = n, let us list V arbitrarily as vi,...,v n ; and for r > set G(V, r) as 
the geometric graph G(vx, ■ ■ ■ ,v n ;r). We are not interested here in the vertex labelling. 
We show that for each such set V we have Xf(G(V,l)) = sup„ 6? M(V, </?); and then we 
give an upper bound on x(G(V, 1)) (from rounding up a solution for Xf) °f t ne form 
(1 + e) maxj =lj _ jm M(V, cpi) + c, where the functions <fi are nearly feasible tidy functions. 
After that, we give three technical lemmas, and use Theorem 11.81 to complete the proof. 

5.1 Deterministic results on fractional chromatic number 

In this subsection we give two deterministic results on Xf{G) for geometric graphs G. 
Recall that if if : W 1 — > M is a function and V C R d is a set of points then M(V, if) := 

Lemma 5.1 Let V C M. d be a finite set of points and consider the graph G = G(V, 1). 
Then 

Xf (G)= sup M(V,<p). 

Proof: Recall that Xf{G) is the objective value of the LP relaxation of the integer 
LP (fl2|) . By LP-duality Xf(G) also equals the objective value of the dual LP: 

max l T y 
subject to A T y < 1, 

y>o. 

For convenience let us write V = {v±, . . . ,v n } where V{ is the vertex corresponding to the 
i-th row of A (and thus the i-th column of A T ). Notice that a vector y = (yx, . . . ,y n ) T 
is feasible for the dual LP if and only if it attaches nonnegative weights to the vertices 
of G in such a way that each stable set has total weight at most one. There is a natural 
correspondence between such vectors y and certain feasible functions (hence our choice of 
the name 'feasible function'). 

Let ip be any feasible function and x G lR d an arbitrary point. We claim that the vector 

y = (v?Oi - x), . . . , tp{v n - x)) T 

is a feasible point of the dual LP given above. To see this note that each row of A T is the 
incidence vector of some stable set S of G; and || (z — x) — (z' — x) || = ||z — > 1 for each 
z 7^ z 1 G S since S is stable in G. Hence 



(A T y) s = J2v(z-x)<l, 
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by feasibility of tp. This holds for all rows of A T , so that y is indeed feasible for the dual 
LP as claimed, i 
This shows that 



LP as claimed. Also, notice that the objective function value l T y equals Y^j=i^P{ v 3 — x )- 



Xf(G) > sup sup S2 <p(vj - x) = sup M(V, (p). 

Conversely let the vector y = (yi, . . . , y n ) T be feasible for the dual LP. Define (p(z) = 
Y^7=i Vi^-z=vi- Then tp is clearly a feasible function, and 

n n 

l T y = < sup Y V{ v j ~ x ) = M ( V > V)» 

3=1 xeRd 3=1 

so that X f(G)< sup^M(V, <p). ■ 

We now turn our attention towards deriving an upper bound on the chromatic number, 
and give another deterministic lemma. Given a > we say that the function tp on M. d is 
a-feasible if the function <p a {x) = <p(ax) is feasible (that is, if 5" C M. d satisfies \\s — s'\\ > a 
for all s 7^ s' G S then ^ sg 5V 9 ( s ) ^ !)• Thus 1-feasible means feasible; and if a < (3 and 
<p is a-feasible then <p is /3-feasible. 

Lemma 5.2 For eac/i e > there exists a positive integer m, simple (1 + e) -feasible, tidy 
functions tpi, . . . , (p m , and a constant c such that: 



x(G(V,l))<(l + e) max M(V, (pi) + c, 

i=l,...,m 



/or eac/i finite set V C 



Proof: Let £ > 0, and let A' G N be a (large) integer. Let us again set p := diam([0, 
and set L := |~(1 +ep)/e\. Observe that \\y — z\\ > 1+ep whenever \yi — zi\ > Le for some 
coordinate 1 < i < d. 

We shall show that there exist (1 + 2ep)-feasible tidy functions <p\, . . . ,(px such that 
the following holds for any V C M. d : 

X(G(V,1))< (l + Ay m a xM (^,^) + (2A) 2d (l + ^y. (41) 

This of course yields the lemma, by adjusting e and taking K sufficiently large. 

We partition JBL d into hypercubes of side e. Let Y be the (infinite) graph with vertex set 
eL d and an edge pq when \\p — q\\ < 1 + ep. For each q G eL d let C q denote the hypercube 
q + [0, e) d . Observe that the hypercubes C q for q G eL d partition M. d . Thus for each z G M. d 
we may define p(z) to be the unique q G eL d such that z G C q . 

Now let Vb = [-Ke, Ke) d n eZ^, and note that |V | = (2A) d . For each p G eZ d let 
r p be the subgraph of V induced on the vertex set p + Vq, that is by the vertices of T in 
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p + [—Ke, Ke) d . Observe that the graphs T p are simply translated copies of T°. Let B be 
the vertex-stable set incidence matrix of r°. 

Now let V be an arbitrary finite subset of M. d Given a subset S of M. d , let us use the 
notation N(S) here to denote \S(~) V\. Let Yy be the graph we get by replacing each node 
q of T by a clique of size N(C q ) and adding all the edges between the cliques corresponding 
to q, q' G Vo if qq' G ^(r ). It is easy to see from the definition of the threshold distance in 
T that G(V, 1) is isomorphic to a subgraph of IV . For each p G eZ d let Ty be the subgraph 
of Ty corresponding to the vertices of T p . 

Consider some p G e7L d . Then xi^v) * s ^ ne objective value of the integer LP: 

min l T x 

subject to Bx > If (42) 
x > 0, x integral 

where b p = (N(C p+q )) qe y and the vector x is indexed by the stable sets in r°. Here we 
are using the fact that T p is a copy of T and that the vertex corresponding to q has been 
replaced by a clique of size 'N(C p+q ). By again considering the LP-relaxation and switching 
to the dual we find that XfiXv) equals the objective value of the LP: 

max (b p ) T y 

subject to B T y < 1 (43) 

y>0- 

Notice that the vectors y = {y q ) q& v attach nonnegative weights to the points q of V is 
such a way that if S C Vo corresponds to a stable set in r then the sum of the weights 
T^q^sVi ls & t m °st one. Note the important fact that the feasible region (that is, the set 
of all y that satisfy B T y < 1, y > 0) here does not depend on p or V. 

The vectors y that satisfy B T y < l,y > correspond to 'nearly feasible' functions tp 
in a natural way, as follows. Observe that x G [— Ke, Ke) d if and only if p(x) G Vq. Let 
tp : ]R d — > M. be defined by setting 




y P (x) if x G [-Ke, Ke) d , 
otherwise. 



Note that tp(x) = J2 q ev ^-ci(x)y q . Then for each p G eZ d 

= E„ e v E 9e y lo« (v - p)Vq = E„ e y v(« - P) 
< M(V,tp). 

We claim next that the functions tp thus defined are (1 + 2ep)-feasible; that is they satisfy 
Tlij=i f( z j) — 1 f° T an y z i> • • • ' z k sucn that H^j — zi\\ > 1 + 2e:p for all j 7^ I. To see this, 
pick such z\, . . . ,Zk- Since <p is outside of [—Ke, Ke) d we may as well suppose that all 
the Zj lie inside [—Ke, Ke) d . For i = 1, . . . , k let pi G Vo be the unique point of Vb such 
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that Zi G Pi + [0, e) d . For all pairs i ^ j we have \\pi — pj \\ > \\zi — Zj \\ — ep > 1 + £p. Thus 
Pi, . . . ,pk are distinct and form a stable set S in r°, and therefore correspond to one of 
the rows of B T . The condition B T y < 1 now yields 

<pOi) + • ■ ■ + <p(z k ) = y P1 + --- + y Pk = (B T y) s < 1. 

This shows that <p is (1 + 2ep)-feasible as claimed, and it can be readily seen from the 
definition of <p that it is simple and tidy. 

Recall that a basic feasible solution of an LP with k constraints has at most k nonzero 
elements and that, provided the optimum value is bounded, the optimum value of the 
LP is always attained at a basic feasible solution (see for example Chvatal [2]). Thus, 
noting that by rounding up all the variables in an optimum basic feasible solution x to the 
LP-relaxation of (j4"2"|) we get a feasible solution of the ILP (j4"2"|) itself, we see that: 



X(K) < Xf(T p v ) + (2K) d . 

Now let y 1 , . . . , y m be the vertices of the polytope B T y < 1, y > and let <f±, . . . , <p 
be the corresponding (1 + 2ep)-feasible, tidy functions. As the optimum of the LP 
corresponding to Xfi^v) is attained at one of these vertices we see that: 

X(r^) < max (W) V + (?K) d < max M(V, <pj) + (2K) d . (44) 

j=l,...,m j=l,...,m 

What is more, for each p G eL d we can colour any subgraph of G(V, 1) induced by the 
points in the set 

W p := p + [—Ke, Ke) d + (2K + L)eZ d , 

with this many colours, since by the definition of L = \(1 +ep)/e] , the set W p is the union 
of hypercubes of side 2Ke which are far enough apart for any two points of T in different 
hypercubes not to be joined by an edge. 
Now let the set P be defined by 

P = eZ d n [-Ke, (K + L)e) d = {{ei u ei d ) : -K < ij <K + L}. 

Note that if p runs through the set P then each q G eL d is covered by exactly (2K) d of 
the sets W p . If H p is the graph we get by replacing every vertex q of T that lies in W p by 

a clique of size ^Ky 1 ra ^her than one of size N(C 9 ) and removing any vertex that does 

not lie in W p , then 

X(H P ) < j^maxM(y, Vj ) + (2K) d . 

This is because we can consider the hypercubes of side 2Kb that make up W p separately 
(and each of these corresponds to some Ty) and for each such constituent hypercube 
q + [—Ke, Ke) d all we need to do is replace the 'right hand side' vector b q by ^y b q in 
the LP-relaxation of the ILP (1421) (the rounding up of the variables will then take care of 



34 



2Ks 



Le 



Figure 1: Depiction of a set W p . 



the rounding up in the constraints, as the entries of B are integers). Because each q G eL d 
is covered by exactly N of the sets W p we can combine the (2K + L) d colourings of the 
graphs H p for p G P to get a proper colouring of IV with at most 



(2K + LY 



— d ^M{V^) + {2K) d 



colours and the inequality (HTj) follows 



5.2 Some lemmas on ip G 3~ and 

We now give some lemmas on feasible functions tp G 5" and on the function / x (t) 
sup^ggr t), needed for the proofs in the following subsections. 



voire) 

2 d S 



Lemma 5.3 sup^g^ J Rd ip(x)dx 

Proof: First note that the function <p K which has the value N ^ K ^ on the hypercube 
(0, K) d and elsewhere is feasible, giving that 



sup / ip(x)dx > lim 



N(2K) 



vo\(B) 



by definition of the packing constant 5. On the other hand, let <p be an arbitrary feasible 
function. Let A C (0, K) d with \ A\ = N(2K) be a set of points satisfying \\a — b\\ > 1 for 
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all a 7^ b G A. If 77 is a constant such that \\a — b\\ > 1 whenever |(a)j — (6)j| > 77 for some 
1 < i < d, then the set 

B := A + (K + 7])Z d (={a + (K + r])z:aeA,ze Z d }) 

also satisfies the condition that \\a — b\\ > 1 for all a 7^ b G B. Set t/>(x) := X^gB i pii> + x). 
Since 99 is feasible we must have ip(x) < 1 for all x. For a G A let us denote by B a the 
"coset" a + (if + 7/)Z d C B, and let us set tp a (x) := X]f,e/3 a < / 7 (^ + x )- We have that 

(K + r]) d > [ ip(x)dx = J2 f ip a (x)dx = N{2K) [ cp{x)dx, 

J[0,K+ v ) d aeA J[0,K+ v ) d JR d 

where the last equality follows because 

/ ip a {x)dx — y. / (f(b + x)dx = 2_. / (p(x)dx 

J[o,K+ v )i bGBa J^ K+v) d beBa J b+[0iK+v) d 

and the sets b + [0, K + r]) d with b G B a form a dissection of M. d . Thus we see that indeed 
for any feasible (p 

(K + r]) d vol(B) 

as required. ■ 
From Lemma [5.31 we may conclude: 

Lemma 5.4 Let w = ^if . Then w < sup^g^ £(</?, t) < c(w,t) for all t G (0,oo]. 
Proof: The lower bound follows from Lemma 15.31 and the fact that £((p,t) > J tp (as 



s > 0) for all if. The upper bound follows from Lemma I5~3l together with (J7|) and part (vi) 

of Lemma O (if <p G 3 and W C M d has vol(W) = ^flf 1 then / ^l {(/ ,> a } < / Wl { i ff > fl} 
for all a G R so that f (<p, t) < £(l w , t)). ■ 

Together with observation (jSJ) from section ll.3[ Lemma 15.41 implies: 

Lemma 5.5 \im t ^ ^ f x (t) = ^/p-- ■ 

Moreover, since <po G 3" so that / x (t) = sup^ggr £(<£>, t) > £(ipo,t) = f u (t) = c(w,t) with 
w = v °2<f^ ; observation (jSJ) gives: 



Lemma 5.6 lim^o /* (£) = 00 ■ 

Since each £(y?,t) is non-increasing in i for each 93 separately, we also have: 
Lemma 5.7 f x (t) is non-increasing. 
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Observe that, by part (v) of Lemma [2. 1[ for any h > 0: 

(— — ) sup t) < sup £((/?, t + h) < sup 

Thus: 

Lemma 5.8 f x (t) is continuous in t. ■ 

We shall also need the following two technical lemmas. 

Lemma 5.9 Let 3* be the collection of all p G 7 that are tidy. For each < t < oo 

sup £(<p,t) = sup£0,t) 

Proof: If t = oo then by Lemma [5.31 and the proof of the lower bound in it, both sides 
of the equation above equal ■ Thus we may suppose that < t < oo. Let ip G 3. It 
suffices to show that 

£(p,t)< sup (45) 



We may assume that p has bounded support, because (by Lemma 12.11 part (viii) ) the 
sequence of functions (p n ) n given by p n = iph n>n id satisfies lim n ^oo £,(p n , t) = £(ip,t). Let 
e > and for each q G eL d let C q := q + [0,e) d as before. Define the function p on 
M. d by setting <p(x) = swp y€OP ( X ) <p(y) (where again p(x) is the unique q G eZ d such that 
x G g + [0,e) d ). Clearly (p > p. Although ip is not necessarily feasible, the function ip' given 
by <p'(x) = 0((l+sp)x) is. Also, <p' is tidy: clearly it is measurable, bounded, nonnnegative 
and has bounded support. That the set {p' > a} has a small neighbourhood for all a > 0, 
follows from the fact that it is the union of finitely many hypercubes (1 + ep)~ l C q . So 
ip' G T. We find that 

£{cp, t) < £{<p, t) < (1 + ep) d Z{tp', t) < (1 + ep) d sup t), 



using Lemma [27TI parts (i) and (iv) for the first and second inequalities respectively. Now 
we may send e — > to conclude the proof of inequality (1451) . and thus of the lemma. ■ 



Lemma 5.10 Let < r < oo and let e > 0. Then there exist m and tidy functions 
ipi, . . . , %p m in 5F such that 

max£(^j, t) > (1 — e) sup£(y?, t) for all t G [r, oo]. (46) 

Proof: Recall that each £(<p,t) is non-increasing as a function of t, and thus so is 
f x (t) = sup „ 6 gr£(<£>, t). Also / x (oo) > 0. Let m and r = r < ri < ■ ■ ■ < r m _i < r m = oo 
be such that fx( T i)/fx( T i+i) < 1 + £ f° r eac h i — 0, 1, . . . , m — 1. Let . . . , ^ m be tidy 
functions in J such that £,(i[)i,Ti) > (1 — e)f x (ri) for each i — 1, . . . ,m. If r, < t < r i+1 
then 

£(VW) > e(^+i,r m ) > (1 -e)/ x (r i+ i) > (1 -e) 2 /x(^) > (1 -e)7x(*)- 
and the lemma follows. ■ 
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5.3 Completing the proofs of parts (iii) and (iv) of Theorem 11.1 



Lemmas 15.51 15.61 15.71 and 15.81 show that f x has the properties claimed in part (iii) of 
Theorem 11.11 



We shall now prove the following theorem, which implies parts (iii) and (iv) of Theo- 



rem 11.11 and is also convenient for the proof of Theorems 11.41 and 11.71 later on. 
Theorem 5.11 Ift(n) := satisfies Hminf n t(n) > then 



anr 



f x {t(n)) a.s. 



Proof: It suffices to prove the following. Assume that liminf n t(n) > where t(n) := y^, 
and let e > 0. We shall show that a.s. 



1 — s < lim inf 



Xf(Gn) 



< lim sup 



X(Gn) 



< 1 + e. 



(47) 



n anr d f x (t(n)) „ anr d f x (t(n)) 

For the lower bound in (T4T1) . let ^i, ... , ip m be as in Lemma 15. 101 Then, using Lemma IBTTl 
Theorem 11.81 and Lemma 15.101 



lim inf 



Xf(G r , 



anr d f x (t(n)) 



> lim inf max 



n"~ T cmr^, t(n)) f x (t(n)) 
> lim mt max „ — H > 1— £ a.s. 



f x (t(n)) 

For the upper bound, observe that for ipi, ... , (p m and c as in Lemma [5.21 by rescaling we 
obtain 

X (G n ) = x(G(r^ 1 X 1 ,...,r- 1 X n ;l)) < (l + e)maxM w +c. 



Hence 



lim sup 



X(G n ) 



< (1 + 5) lim sup max 



anr d f x (t(n)) ' n i anr d £((pi,t(n)) 

This completes the proof of (1471) . and hence of the Theorem. 



< 1 + e a.s. 



6 Proof of Theorem 11.3 

Recall that, since ifo = l^o-J.) G ^ is feasible, and using (Q we have 

f x (t) = supf(<p,t) > £(tp Q ,t) = c(w,t) = f u (t), 



where w = vol Sf ■ Thus Lemma [5.41 implies part (i) of Theorem 11.31 



In the remainder of the Section [6] we shall thus assume that 5 < 1. Let us set 
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t :=m£{t>0:f x/u {t)^l}. (48) 

To achieve our target for this section, we still need to show that < t < oo and that 
/x/w(^) ^ s strictly increasing for t > t . We do this in the next subsections. 

6.1 Proof that < to < oo 



Note first that £(y?o, t) > for all t > 0, and by part (v) in Lemma l2TT1 <f (cg n . t) is continuous 



in t. Together with Lemma [5.81 this shows that f x / u is continuous too. 



Since we have already established that lim^oo f x (t) = v °^^ and lim^oo fu(t) - 



2" 



we have lim^oo f x /w(t) = This implies that to < oo and it therefore only remains to 
show that to > 0. 

Let us first give a quick overview of the proof of this section. We first show that 
£(i£( 0; 3),t) < 2£((p ,t) holds for all sufficiently small t > 0. We then prove that if t 
satisfies £(1b(o ; 3), £) < 2^(ip ,t) then f x / u (t) = 1. To do this we assume that f x / u (t) > 1, 
so that there is a feasible function ^ with £(il>,t) > £{<fo,t). We show that there must 
then be such a function which is simple and has support contained in 5(0,2); then we 
use a convexity argument to replace if; by a function which takes values only in {0, |, 1}; 
and then that there must be a function tpp satisfying £((fp,t) > £((po,t) where ipp takes a 
particularly easy form, so that finally we can prove analytically that this cannot happen. 

Now we proceed to fill in the details. 

Lemma 6.1 There is a T > such that 

e(l B (o;3),i) <2£(</M), (49) 

for allO <t<T. 

Proof: We shall prove the stronger statement that £(1,b(o ; 3), £)/£(y?o, t) — > 1 as t j. 0. 
(Observe that this implies the lemma.) 

Pick W G {5(0; 3), 5(0; 1/2)}, and set w = vol(W). Recall from section O that 
£(lw,t) = c(w,t) where c(w,t) > w solves wH(c/w) — 1/t. Clearly 

c(w,t) — )• oo as t 4- 0. (50) 
Writing out the expression for wH(c/w) we find 

wH(c/w) = c(w, t) In c(u>, t) — c(w, t) In it; — c(w, t) + w. (51) 
Thus, combining (15 Op and ( 151]) we see that 
1 



wH(c(w, t)/w) — (1 + o(l))c(u?, t) lnc(it), t) as t 4- 0. 



But this gives 
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aiw, t) = (l + o(l)) (j) I In (jj as £ 1 0. (52) 

Since §2]) is both true for W = 5(0; 3) and W = 5(0; 1/2), we see that £(l B (o i3 ), t)/£(tp , t) -> 
1 as £ 4 0, which concludes the proof. ■ 

Lemma 6.2 If t > satisfies ( fjpp and yHi_ _^ £ as n _!. ^ en a .a.a.s. there exists a 
subgraph H n ofG n induced by the points in some ball of radius 2r snc/i £/ia£ xi^n) — x{H n )- 

Proof: Suppose that £ satisfies (JUD and -» £ as n -> oo. Let = 5(0; 3). By 
Theorem 11.81 and Theorem 14.11 we have 

-r-K(W,£) and -p-t^tpoit) a.s. 

,j nr -<i anr a 

and so < 2u(G n ) a.a.a.s. It is convenient to identify vertex % of G n with Xi. Note 
that if two vertices with 2r < \\X{ —Xj\\ < 3r have degrees > w(G n ) then there must exist 
a translate of rW (centred at \{Xi + Xf)) containing at least 2u(G n ) + 2 points, and so 
the condition Mw < 2u(G n ) fails. Hence a.a.a.s. any two vertices Xi and Xj of G n with 
degrees > ou(G n ) satisfy either \\Xi — Xj\\ < 2r or \\Xi — Xj\\ > 3r. But if we remove from 
G n all vertices which have degree at most u(G n ) — 1 then the chromatic number does not 
change, and by the above we will be left a.a.a.s. with a graph in which each component is 
contained in some ball of radius 2r. This completes the proof. ■ 

We will show that £o > T (with T as in Lemma [6. II) by proving that sup e3 r £) = 
£((po,t) for all £ that satisfy (]49p . The following purely deterministic lemma perhaps 
surprisingly has a convenient probabilistic proof. 

Lemma 6.3 Let t > satisfy (jHO). Then 

sup£(y?,£)= sup £(<p,t). 

supp(v5)CB(0;2) 

Proof: Let r satisfy o~nr d ~ £lnn and consider x(G n ). By Lemma 16.21 a.a.a.s. x{G n ) 
equals the maximum over all x £ M d of the chromatic number of the graph induced by 
the vertices in B(x; 2r). Let us fix an e > 0. Let us denote V := r~ x {X\, . . . , X n } and let 
T, T p , ry, T v be as in the proof of Lemma 15.21 For p £ eL d let A p denote the subgraph of 
T induced by the points of eZ d inside B(p, (2 + ep)), where again p := diam([0, l] d ), and 
let Ay be the corresponding subgraph of T v . Since for every x £ W 1 the subgraph of G n 
induced by the vertices inside B(x, 2r) is a subgraph of some Ay, we have: 

x{G n ) < maxx(Ay) < max M lfi + c a.a.a.s., (53) 

p i=l,...,m 

where tpi, . . . , tp m are obtained from the ILP formulation of x(Ay) via the same procedure 
we used in the proof of Lemma [5T21 (that is, the upper bound in ([331 is the analogue of the 
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upper bound in (1441) ) and c is a constant that depends only on e, d and || . || . By construction 
we have that supp(y?j) C 5(0, 2+2ep) and that p\ given by ip'^x) = <pi((l+2ep)x) is feasible. 
Notice that tp^ also satisfies supp(<^) C 5(0; 2). Thus, (J53l) together with Theorem 15.111 



Theorem 11.81 and part (iv) of Lemma 12. II shows that 



sup < max £(</?*,*) < max (1 + 2ep) d ^' i ,t) < {l + 2ep) d sup £(<p,t). 

i=l, ...,m i=l, ...,m 



supp(i/>)CB(0i2) 



The statement now follows by letting e — > 0. 



Let us now fix a t > that satisfies (|49|) . If sup^^ £(</?, t) > £(<po,i) then there must 
also exist a feasible simple function ip := Y^k=i m^ A k w hh supp(^) C 5(0; 2) such that 



£(ip,i) > £(v?0;£) ; because (by Lemma [2.11 item (viii) ) for any ip the increasing sequence 
of functions (<p n ) n given by tp n = Yl=i ^{^< v< ^} satisfies lim^^ £(p n ,t) = £(<p,t). 



So let ip = Y^iLi ~^Ai be a feasible simple function with £(ip, t) > £,(po, t) and supp(^) C 
5(0; 2). We may suppose wlog that the are disjoint and m is even. For 1 < k < y let 
ipk be the function which is \ on \Si=k and 1 on {J i>m _ k Ai. We can write 



ip 



2 m ' 2 

— y^* 

m z — ' 

k=l 



i— i = — , and if x G A. 

m 2 m> ' 



.,• with 



because for x G Ai with i < m/2 we have — J^^i ipk( x ) 

i<m/2thaniE^^) = l-^i=V 

Let us now observe that £ is convex in its first argument; that is, for any two nonneg- 

ative, bounded, measurable functions a, r and any t > and A G [0, 1] we have 

£(Aa + (1 - A)r, i) < A£(a, t) + (1 - A)£(r, i). 



This follows from parts (ii) and (iii) of Lemma [27T1 Because we have written ip as a convex 



combination of the we must therefore have £(ijj,t) < £(ipk,t) for some k. 

Let us first assume that {ipk = 1} = Ui> m -fc ^ = Since supp(^) C 5(0; 2) we must 
have that ^ < </?', where <p' is the function which is \ on 5(0; 3) and elsewhere, and thus 
also £(ip,t) < £(ipk,t) < £(<£>', t) = |£(ls(o;3),0 (by choice of k and Lemma I2TT1 items (i) 



and (ii)). But then (j49|) gives: 



< -£(l B (0;3),t) <£(^0,t), 

a contradiction. 

So we must have {ipk = 1} ^ 0. Let us denote by C := cl(5) the closed unit ball. Notice 
that 

diam({^ fc = 1}) < 1, 



supp(^ fc ) c p| (x+c), 



x:ip k (x)=l 
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by feasibility of ij) (if x G supp(^fc) and ipk(y) = 1 then ip(x) + ip(y) > — + = 1). 
Bieberbach's inequality (Lemma I4.2[) tells us that vol({^fc = 1}) cannot exceed v °ia , 
the volume of a ball of diameter 1. Hence there is a < /3 < 1 with vol({^& = 1}) = 
vol(5(0; ^^)) = (1 — /3) d ml ^ . We will need another inequality, given by the following 
lemma. 

Lemma 6.4 (K. Boroczky Jr., 2005) Let C C ]R d fre a compact, convex set. Let A C 
]R d fre measurable and let A 1 be a homothet (that is, a scaled copy) of —C with vol (A) = 
vol(A'). Then 



vol m (a + C) j < vol m (a + C) j . 



With the kind permission of K. Boroczky Jr. we present a proof in appendix El because 
such a proof is not readily available elsewhere. 
It follows from Lemma 16.41 that we must have 



vol(supp(^)) < vol ( fl (x + C)\= vol (b{0; i±^)^ 



d vol(B) 



2 d 



For < /3 < 1 let ^ be the function which is 1 on B(0; ±=£) and ± on 5(0; ±±£)\B(0; ^). 
(This agrees with our earlier definition of ip .) We see that vol({^fc = 1}) = vol({<pp = 1}) 
and vol({^ fc = ±}) < vol({ipp = |}). Thus we have / ^fcl{»/'fc>a} < / <Pp^-{<Pp>.a} for all 



a, which gives ^(ipk,t) < £(<fp,t) by part (vi) of Lemma [27T1 We may conclude that if 



sup^ggr £(<£>, t) > £(tpo,t) and (1491) holds then also £(<fip,t) > £(tpo,t) for some < /3 < 1. 
We will show that this last statement is false. Set /i(/3) := £(<pp,t) for < /? < 1. 



Lemma 6.5 



max = max{u(0), u(l)|. 

0</3<l 



Proof: Notice that for < < 1 



M0) = ^^(^((i + ~ (1 - Z 3 ) V /2 + (1 - /^ eS )> 
where s = s(/3) solves 

(((1 + - (1 - /3) d )tf(e*/ 2 ) + (1 - P) d H(e s )) = \. (54) 

The function is continuous on [0, 1]. Differentiating equation fl54l wrt /3 we see that 
for < /3 < 1 

= + Z?)^ 1 + (1 - P) A - x )H{e 8 l 2 ) + ((1 + (3) d - (1 - /3) d ))fe s /V - d(l - (3) d ~ l H(t 
+ (1 - /3) d se s s'. 
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(That s is differentiable wrt /3 can be justified using the implicit function theorem.) This 
gives 

, = d(l - P) d ~ l H(e s ) - d((l + PY" 1 + (1 - f3) d ~ l )H(e s l 2 ) 
S ^ l) ~ ((l + /3) d -(l-/3) d )fe s / 2 + (l-/3) d se s 

Thus, 

^£V(/3) = d({l + /3) d - 1 + {l-/3) d - 1 )le s / 2 -d(l-/3) d - 1 e s 
+ s'(((l + f3) d - (1 - /3) d )±e s / 2 + (1 - (3) d e s ) 
= d{{\ + + (1 - P) d - { )\e s / 2 - d(l - P) d ~ l e s 

+J(tZ(l - /3) d - 1 iY(e s ) - tZ((l + Z?)^ 1 + (1 - py- 1 )!!^ 2 )) 
= i [d((l + + (1 - /3) d - 1 )(e s / 2 - 1) - d(l - f3) d ~\e s - 1)] . 

Clearly //(/?) > for /3 sufficiently close to 1, so that it suffices to show that (for any t) 
//(/?) = for no more than one /3 G (0, 1). Note that = if and only if 

e^-l = ((i±|)^ + l)(e^-l). 

Writing a := + l and x := e s / 2 this translates into the quadratic x 2 — ax + (a— 1) = 

0, which has roots l,a — 1. Now notice that e s ^ 2 = 1 would give s(/3) = 0, but this 
is never a solution of fl54|) . So if //(/?) = for some < /3 < 1 then we must have 
s(/3) = 2{d — l)ln(^4). Notice that, as s cannot equal 0, this also shows that we must 

have d > 2 for //(/3) = to hold. The curve u((3) := 2(d — 1) ln(^l) has derivative 



l-/3)(l + /3) 
On the other hand, for < /3 < 1 

m < d(1 r < " 



l-/3) d se s 1-/3' 

We find s'(/3) < ^ < 4(d - 1)/(1 + f3)(l - f3) = u'(/3) for < f3 < 1 (recall d > 2 by a 
previous remark). We may conclude that the curves u({3) and s(/3) meet in at most one 
point, as u((3) — s((3) is strictly increasing on (0, 1). In other words, there is at most one 
G (0, 1) with //(/?) = and the lemma follows. ■ 



Since we had chosen t so that ( 149]) holds, parts (iv) and (i) of Lemma I2~T1 tell us that: 



Thus 



max n(P) = n(0) = £(<p ,t). 



Observe that we have now achieved our aim in this subsection, namely to show that 
to > 0. Thus we have proved the following lemma. 
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Lemma 6.6 When the packing constant 5 < 1, we have < to < oo. 



6.2 When 5 < 1 the function f x / u (t) is strictly increasing for t > to 

In this subsection we shall prove the result just stated in the heading. The proof uses the 
following lemma. 

Lemma 6.7 For each t > 0, either sup^ e3 -£(<^, t) = ^gj^ or the supremum is attained. 

Proof: Let us assume sup^gr £(</?, t) > v °j^ (as otherwise there is nothing to prove). Let 
us consider a sequence <£> 2 , • • • S 3" such that 

lim £(<^ n , t) = sup £(<p, t), (55) 

and let us suppose (wlog) that 

lim / (p n exists and is as large as possible subject to f|55|) (56) 
n Jb 

(recall that B = 5(0; 1) is the unit ball). We will first exhibit a subsequence <p ni , <p n2 , . . . 
of (tp n ) n and a function ip G 3" such that 

lim sup (p nk (x) < tp(x) for all x G M. d . (57) 

It will need further work to show that the function ip achieves the supremum as required. 

In order to construct ip and the subsequence (<f nk )ki let ^fc be the dissection {i + 
[0, 2~ k ) d : i = (i u ...,id) G 2~ k Z d } of M d into cubes of side 2~ fc (observe that Dfc+i refines 
T>k). For cr G 3" let us define the functions a k by setting: 

a k (x) : = sup a(y), 

where C x ^ is the unique cube CgDj with x G C. Let us now construct a nested sequence 
3i ^ 9^2 ^ ■ • ■ of infinite subsets of <^ 2 , • • • } with the property that 

|cr fc (x) - T k (x)\ < - for all x G [-&, /c) d and all o~,t G 3V (58) 

To see that this can be done, notice that the behaviour of a k on [—k, k) d is determined 
completely by (a k (pi), . . . ,a k (px)) where pi,...,px is some enumeration of [— k, k) d D 
2 _fc Z d . Given 3fc_i there must be intervals Jx, ... ,1k C [0, 1] each of length ~ such that 
the collection {a G 3fc_i : o- k {j>i) G Jj for all 1 < i < iT} is infinite. So we can take 3fc to 
be such an infinite collection. Let us now pick a subsequence cp ni ,if n2 , ... of (<p n )n with 
(p nk G 3& and let the function ^ be defined by: 

ij>{x):= lim ^ Or). (59) 

fc-»oo 
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To see that this limit exists for all x, notice that f l n ,(x) < fnX x ) — fn k ( x ) + \ f° r a ^ 
I > k > IIxIIoq. Thus, 

limsup fc _ >00 <^ fc (:r) < inffc>||x|| 00 ^L<>) + \ 
= liminffc^oo^^x) + \ 
= liminffc^oo^^x). 

We now claim that ip and the sequence (ip required (that is, ip G 5F and (137)1 

holds). To see that (157)) holds, notice that sup J>fe ip ni (x) < <fn k (x) + | for any x and any 
k > Halloo, so that 

lim sup (p nk (x) < lim (f^Ax) = ^( x )- (60) 

k_ >00 k— >oo 

To see that ip G jF, let S = {si,...,s p } G S be finite (observe it suffices to show 
Y2xes' t l ; ( x ) — f° r a ^ finite 'S' G S). Since ||sj — Sj|| > 1 for all z ^ j, there is a k 
such that ||sj — Sj\\ > 1 + 2~ k °p for all z ^ j where p := diam([0, l] d ). Thus if k > ko then 

+ + < 1, 

and hence the same must hold for 

Also notice that the dominated convergence theorem (using ip,<Pn k < 1) gives that 

/ i/j(x)dx = lim / <p\ (x)dx > lim / ip n (x)dx, (61) 
Jb ^°°Jb k n ^°°JB 

using (|60|) for the second equation. Furthermore, for any fixed R > we have that 

lim £«lfl(o ; fl),*) = £W>l S (o,*),*) < £^M)- (62) 



Here we have used parts (i) and (vii) of Lemma I2TT1 Hence, there also is a sequence (Rk) 



with Rk tending to infinity and 

1 ?(0;i? fc ) 



limsup£(¥& Wfc),*) < £foM). (63) 



To see this, notice that by (162)) there exist fci < < . . . such that ^(v* fe lB(o;m),*) < 
£(if),t) + — for all fc > fc m . Thus, we may put Rk '■— max{m : k m < k}. 
Let us put 

fpk,i : = ^n fc lB(0;« fc ), ^fc,o := ^r lfc lEd\ B(0 . jRfe+1) , Ipk ■= 1pk,i + ^k,o- (64) 

We may assume wlog that has been chosen in such a way that £,(ipk,t) = (1 + 
o(l))£(y> nfe , t). To see this note that for s = s(tp nk ,t) there is an ^ < i?' < Rk s.t. 



^n fe e^lB(0;K' + l)\B(0;fl') < / ■ 
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If we take such an R! and set ifj' k := l Pn k ^-R d \B(o;R'+i)uB(o;R') then s(ifj' k ,t) > s(ip nk ,t) (by the 

definition of s, as i/;' k < <p nk ) so that ^' k ,t) > (1 - -^—)^(<f nk ,t). 

L 2 J 

Observe that by our choice of Rk 



lim £(?/> fc ,t) = lim f(<p„ fc ,t) = sup £(</?, t), (65) 

fc— >oo k—>oo ipdJ 

and (since Rk > 1 for sufficiently large) 

lim ip k = lim / y2 nfc = lim / (/?„. (66) 



Let us define A(y?) := sup 5gS ^ xgS <^(^)- Since ipk < y3 n . fe G ^ we have 

Atyfc) = A(^ fc ,i) + A(V*, ) < 1. (67) 

For convenience let us write A^ := X(ipk,o)- First let us suppose that Xk — > as k — > oo. 
Notice that ^^fc,o £ 3 r ) which implies 

£(ii>k,o, t) < A fe sup £(tp, t) = o(l), 



using part (ii) of Lemma [2. 11 Observe that 



by parts (i) and (iii) of Lemma 12711 Thus, £fyk,t) = £(i>k,ht) + o(l). Using (1651) and ((63 



sup£(y?,t) = lim £(ipk,t) < 

so that the lemma follows in the case when A& — > 0. 

In the remaining part of the proof we shall show that in fact we must have A& — > 0. 
Let us assume not, and that lim sup A^ > 0. We may assume for convenience that lim \ k = 
A > (by considering a subsequence if necessary). We next establish the following claim. 

Claim 

vol({^ fc ,o > e}) ->• for all e > 0. (68) 

Proof of (168j) : Let us construct a new sequence of functions ip' k as follows. For each k 
pick an Xk G M, d \ B(0] Rk + 1) that maximises J B ^ Xk .^ 4>k,o- 

To see that such an Xk exists, let us write I(x) := J B i x .^ il>k,o- Notice that / is 
continuous {jpk,o < 1 so that \I(x) — I(y)\ < vol(B(x; 1) \ B(y;l))). Let us suppose 
that c := sup a . 6K d\ B ( ;fl fc +i) H x ) > 0> f° r otherwise there is nothing to prove as any x G 
R d \B(0;R k + l) will do. We first claim that the set {x G R d \B(0; R k + 1) : I(x) > f } can 
be covered by at most [ 2vo ' ( -' B - > j balls of radius two. This is because if I(xi), . . . , I{xk) > § 
there must exist ?/j G B(xi] 1) for 1 < i < k with ipk,o(yi) > 2voi(B) • ^ feasibility of 
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we must have either k < 2 vo '( B ) or \\yi — yj\\ < 1 for some i ^ j. Thus, the yi can be 
covered by at most 2 vo '( B ) balls of radius one, and hence the can be covered by at most 
2vol( -' B - > balls of radius two, as claimed. As / is continuous and we can restrict ourselves to 
a compact subset of lR d \ -6(0; Rk + 1) we see that the supremum c = sup a . gR dv B ( . Bfe+1 ) I(x) 
is indeed attained by some point x k E R d \ B(0; R k + 1). 

Now let ip' k := ipk,i + 4>k,o ° Tk where Tk : y >->■ y + Xk is the translation that sends to 
Xk- By (l67j) we have -0^ e 3^. Notice that 



for all a, because > a} D {ipk,i > a,} U T^T 1 [{^fc.o > a }] so that 

= J^fcl{^ fc >a}- 



Part ( vi) of Lemma 12.11 therefore gives that 

a^t)>aM = (i+o(i))av>n k ,t). 

Thus £(il}' k ,t) — > sup^gj £(</?, t) as A; — > oo, and we saw earlier that ip' k G 9\ We there- 
fore must have fg^.y ^k,o — > 0; for otherwise, using fl66|) . we have lim sup fc J B ip k > 
\im k f B i>k — nm n J B fn, an d (a subsequence of) the ip'k would contradict (I55|) . Now 
suppose that for some £ > we have limsup fc vol({^ i0 > e}) = c > 0. Because 
^fc,o G ^ we can cover {ipk,o > ^} by at most |_-J balls of radius 1. But this gives 
limsup fc § B , Xh . x ^ ipk,o( x )dx > ce and we know this cannot happen. The claim fl68|) follows. 
■ 

Recall that Ok '■= j-i>k,o G 9^ Because lim fc A& = A > the previous also gives 
lirrifc vol ({(Tfe > e}) = for all e > 0. Let us fix e > for now and let V e , Wk j£ C W 1 be 
disjoint (measurable) sets with vol(V^) = v °^d$ and vol(W& )E ) = vol({o"fc > e}). Let us set 
r k := el Ve + l Wk e . Then / cr fc l {f7fc > a} < / rjfcl^^} for all a (using a fe < 1 and LemmaES]), 



so that parts (vi) , (ii) and (iii) of Lemma 12 . 1 1 give 



= ec(vol(K),t) + c(vol(W fcl6 ),0- 



Now a; = c(w,t)w solves H(x) = l/(wt), so x — >■ 1 as w — > 0; and thus c(u>,t) ~ w 

ro\(B 
2 d 5 



as w — > 0. Hence ec(vol(V r e ), £) ~ evol(V^) = v °'i? as e -> 0; and for any fixed £ > 



c(vol(W4 j£ ),t) -> as -> oo. 

It follows that limsup i ._ >0O £((7fc,f) < ■ Since er£. := jzx"^k,i G ^ by fl67|) . we thus 
have 

lim f(^ fc ,t) = lim ^(A fc a fc + (1 - A fc )4,t) < A V °^f - + (1 - A)sup£(y2,t) < sup £(</>,£), 

k^oo fc->oo ^"0 ^gj ^gj 
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using parts (i) and (ii) of Lemma I2TT1 But this contradicts equation (|65|) : so we must 
have Afc — > 0, completing the proof of Lemma 16.71 ■ 



Lemma 6.8 Assume that S < 1 (so < t < oo). Then the function f x /uj{t) is continuous 
and strictly increasing for to < t < oo . 



Proof: That f x /u(t) is continuous follows immediately from parts (iii) of Theorems 11.11 
and 11.21 which we have already established in sections H] and [5j Let us observe that, by 
definition f|4*8|) of to, it suffices to show that whenever t > is such that f x / ul {t) > 1 then 
f x /u(t') > f x/uJ (t) for all t' > t. 

Consider a t > t with f x /u(t) > 1. First suppose that sup^ggr^y?, t) = v ■ Notice 
that the lower bound in Lemma 15^41 then shows that also sup^gj t') = for all 

t' > t, so that in this case f x / u {t') > f x / w {t) for all t' > t as £(y?o 5 £) is strictly decreasing 
in t. 

So we may assume that sup^g^ t) > v ° . By Lemma IBTTl there is a (p G 
with < j < oo such that the supremum equals £((p,t). Next, we claim that would 
suffice to prove that for any A > 1 there is at most one t! > that solves the equation 
£(tp,t') = \£,(ipo,t'). This is because, since £(y?,to) < ^(^0,^0) an d t' H- £,(<p,t')/£,((po,t') is 
continuous in t', we would then also get that 

for all t' > t. 



Set ifj := \<p with A > 1. By Lemma [2TTI part (ii) , €(ip,t) = \£((p ,t). So to prove the 



lemma, it suffices to show that the system of equations 

<p(x)e Wip{x) dx = [ ^(x)e s ^ {x) dx, (69) 



f H(e w ^)dx= [ H(e s ^ {x) )dx (70) 

JR d JR d 

has at most one solution (w, s) with w, s > 0. For s > let v (s) be the unique solution 
of fl69l and let u(s) be the unique non-negative solution of flTOj) . Let us write 



F(s) := v(s) — u(s). 

Our goal for the remainder of the proof will be to show that there is at most one solution 
s > of F(s) = 0, which implies that there is at most one solution (w, s) with w, s > of 
the system given by fl69|) and flTDl and thus also implies the lemma. 
Differentiating both sides of equation ( |69l) wrt s we get 

/ v'(s)ip 2 (x)e v{s)ip{x) dx = [ *p 2 (x)e s ^ x) dx, 
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where we have swapped integration wrt x and differentiation wrt s. (This can be justified 
using the fundamental theorem of calculus and Fubini's theorem for nonnegative functions; 
and that v is different iable can be justified using the implicit function theorem - for more 
details see the footnotes in section El) This gives 

Similarly, differentiating ( I7U|) wrt s we get that for s > 0: 

sf Rd ^(x)e^dx 
{) u(s)J Rd ^(x)e^M*)dx K ' 

Observe that from ( 1711) we get 

w J Rd if 2 {x)e v ^ x )dx ~ j Rd if{x)e v ^ x )dx v ' 

where we have used the specific form of ip as a constant times an indicator function, the 
fact that <^> 2 {x) < tp(x) (as < tp(x) < 1 for all x) and the fact that v(s) solves ( 1691) . 
From (17111 and ( 1721) we see that: 

If u(s) = v(s) for some s > then u'(s) = —-^-v'(s). (74) 

Let us first suppose that v (0) > 0. Since u'(s) > A > 1 for all s > 0, it follows that v(s) > s 
for all s > 0. Thus, from ( 1741) we see that whenever i>(s) = u(s) we have ^'( s ) < v '( s )- ^ n 
other words, at every zero of F we have F'(s) = v'(s) — u'(s) > 0. This shows F can have 
at most one zero, as required. 

It remains to consider the case when v(0) < 0. We may assume that there is a solution 
s > to v(s) = u(s), and since F is continuous there is a least such solution s\. Now u(0) = 
so F(0) < 0, and thus F(s) < for each < s < Sl . Hence F'( Sl ) = u'(si) - m'( s i) > 0. 
From (174]) we now see that v(si) > Si, so that using (175]) we have u(s) > s for all s > si. 
Reasoning as before this gives that: 

If F(s 2 ) = for some s 2 > s x then F'(s 2 ) > 0. (75) 

Hence, if F(s) > for some s > s\ then -F(s') > for all s' > s: for if not then there is a 
least s' > s such that F(s') = 0. Such an s' must then satisfy F'(s') < 0, contradicting ( 1751) . 

Now suppose there is an s 2 > si such that F(s2) = 0. By the above we must then have 
F(s) < for all si < s < s 2 . Hence, for si < s < s 2 we have 

t/(a) = n( g ) J Rd ^ 2 (x)e^M-)dx > «W j 
s / Rd v 92 (^)e ,,(sMa;) dx ~ s 

(using that u(s) > v(s) and v(s) > s). Thus F'(s) > for all Si < s < s 2 . But this implies 
F(s 2 ) > F(si) = 0, contradicting the choice of s 2 . 
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It follows that s± is the only zero of F, which concludes the proof of the lemma. ■ 
Lemmas 16.61 and 16.81 give Theorem 11.31 



7 Remaining proofs 



There are still some loose ends. Here we will finish the proof of Theorems 11.11 11.21 11.41 
and 11.71 and Proposition 11.61 



7.1 Proof of parts (ii) of Theorems 11.11 and 11.21 



The following Lemma will immediately imply parts (ii) of Theorems 11.11 and ll.2( and it 



is also convenient for the proof of Theorem 11.41 later on. Recall that the notation A n 
a.a.a.s. means that P(A n holds for all but finitely many n) = 1. 

Lemma 7.1 For every e > 0, there exists a (3 = /3(a,s) such that if n^ 13 < nr d < (3\nn 
for all sufficiently large n, then 

(1 — e)k(n) < u(G n ), x(G n ) < (1 + e)k(n) a.a.a.s., 
where k{n) := Inn/ In 

Proof: Set Wi := 5(0; \) and W 2 := cl(B), and observe that 

M Wl < u(G n ) < x(G n ) < A(G n ) + 1 < M W2 , (76) 

where A denotes the maximum degree. Now set (3 := min(/3(W / i, a, e), f3(W2, c, e)) with 
f3(W, a, e) as in Lemma [3.91 The statement immediately follows from Lemma [3.91 together 
with flZOD. ■ 



7.2 Proof of parts (i) of Theorems 11.11 and 11.21 and of Proposi- 
tion 11.61 

Our plan is to use Lemma [3751 on the generalised scan statistic. To make use of this lemma, 
we 'split r into parallel sequences'. Let if € N be such that < a. For k = 0,1, ... ,K 
set cifc := t~~t and set: 



r k [n) := 

for 1 < k < K and for k = set: 



r(n) if n afe ~ 1 < nr < n ak , 
n~K-i+i)/d otherwise 



r (n) :-- 



r{n) if nr d < n a °, 
n~ 4 l d otherwise 
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Observe that n k 2 < nrf < n k+ ? for k — 1, . . . , K and nr^ < n 2 . Hence 



rin) = rfc(n) if and only if k 



Inn I 1 1 
ln(nr d ) I ~* 2 



(77) 



Let us now put Gn '■= G(Xi, . . . ,X n ;rk(n)) for 1 < k < K. Because G n coincides with 
some Gn for each n, and the intersection of finitely many events of probability one has 
probability one itself, it suffices to show that x{G n ) = oo(Gn) G {k, k + 1} a.a.a.s. for 
each k separately. 

Let us thus suppose that there is a fixed < k < K such that r(n) = rk(n) for all n. 
Set Wi := 5(0; \) and W 2 := B(0; 100). Then we again have 

M Wl < cu(G n ) < X (G n ) < A(G n ) + 1 < M W2 . (78) 
(The reason for choosing radius larger than 1 will become clear shortly.) Now notice that 



part (i) of Lemma [3.81 shows that 

M W2 <k+l. (79) 



If k G {0,1} then M\y x > k holds trivially, and for k > 2 we can apply part (ii) of 
Lemma [3.81 to show that cL. cL. cL. S. '. 

M Wl > k. (80) 



Putting ( 1781) . ( 1791) and ( IHUl) together, we see that we have just proved parts (i) of Theo- 
rems 11.11 and 11.21 

To finish the proof we will derive (deterministically) that if (179]) and ( 180|) hold then 
x(G n ) = u{G n ) must also hold. So let us assume that (1791 and (IHOj) hold. First note 
that A(G n ) G {uj{G n ) — l,u(G n )}. If A(G n ) = w(G fi ) — 1 then we are done (since always 
x{G) < A(G) + 1), so let us suppose that A(G n ) = u(G n ). In this case Brooks' Theorem 
(see for example van Lint and Wilson [20]) tells us that x(G n ) = u(G„) unless u(G n ) = 2 
and G n contains an odd cycle of length at least 5. Let us therefore assume that oo(G n ) = 2. 
Then we must have k < 2 and hence Mw 2 < 3. But now each component of G n must have 
at most 3 vertices: to see this note that if the subgraph induced by X^, Xi 2 , Xi 3 , is 
connected then all four points are contained in a ball of radius < lOOr. Hence there are 
no odd cycles of length at least 5 and so x(G n ) = 2 as required. 

7.3 Proof of Theorem [TBI 

Let r be any sequence of positive numbers that tends to 0, and let e > be arbitrary, but 
fixed. Our aim will be to prove 

1_ £ < < i + £ a.a.a.s., (81) 

which clearly implies the theorem. 
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Set e' := j^- — 1, and let < f3 < (3(a, e') where f3(a, e') is as in Lemma [7TT1 above. We 
can assume without loss of generality that (3 < to/a with to as in Theorem 11.31 if 5 < 1. (If 
5=1 then we do not need any additional assumption.) We will apply the same trick we 
used in the previous section. Set ri(n) := min(r(n), n~^ +1 ^ d ), 



r 2 (n) :-- 




if n 13 < nr d < (3 In n 
otherwise. 



and r 3 {n) := max((^) d , r(n)). Now set G#° := G{X 1 , . . . , X n ;r k {n)) for k = 1,2,3. It 
again suffices to prove ( IHTj) for each Gn separately. 

Notice that — > as n — > oo, so that f x /^(~^) 1- The statement (18T|) for 
now follows immediately from Proposition 11.61 

Since liminf^^^- > 0, the statement flHT]) for Gn 3 ' ) follows immediately from Theo- 
rems 15.111 and 14.11 



„d 



Finally, notice that < to for all n, so that in fact / X / £J (^- 2 -) = 1 for all n. Applying 
Lemma \7. II we see that 

Y(d^) 1+5' 

1 < 77^- < = 1 + 5 a.a.a.s. 

" cu(G^) 1-^ 

This concludes the proof of Theorem 11.41 



7.4 Proof of Theorem [TT7 



When 5 = 1 then the theorem follows directly from u(G„) < < ^(^n) together 



with part (i) of Theorem 11.31 and Theorem 11.41 

Let us thus suppose that 5 < 1. We can again "split into subsequences" as in sections [7721 
and 17.31 to see that it suffices to prove the theorem for the case when anr d < to In n and 
the case when anr d > to In n. 

If anr d < to Inn then f x / UJ ( £ ^-) = 1 so that the argument for the 5 = 1 case applies. 

Finally, if anr d > to Inn then in particular liminf y— > 0; and now we may use (HTI) to 
complete the proof. 



8 Concluding remarks 

In this paper we have proved a number of almost sure convergence results on the chromatic 
number of the random geometric graph and we have investigated its relation to the clique 
number. Amongst other things we have set out to describe the "phase change" regime 
when nr d = O(lnn). An important shift in the behaviour of the chromatic number occurs 
in this range of r (except in the less interesting case when the packing constant 5=1). We 
have seen that (except when 5 = 1) there exists a finite positive constant to such that if 
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anr d < to In n then the chromatic number and the clique number of the random geometric 
graph are essentially equal in the sense that 

ui{G n ) 

and if on the other hand anr d > (t + e)lnn for some fixed (but arbitrarily small) e > 
then the lim inf of this ratio is bounded away from 1 almost surely. Moreover, if nr d ^ In n 
then 

X(Gn) , 1 

where 5 is the packing constant. 

We have also given expressions for the almost sure limit f x (t) of X ^"J and the almost 
sure limit f x /u,(t) of if anr d ~ thin for some t > 0. An interesting observation is 

that to an d the limiting functions f u (t), f x (t) and L u (t) do not depend on the choice of 
probability measure u, and that the only feature of the probability measure that plays any 
role in the proofs and results in this paper is the maximum density a. 

It should be mentioned that considering the ratio ~(§N , apart from the fact that it 



provides an easy to state summary of the results, can also be motivated by the fact that 
while colouring unit disk graphs (non-random geometric graphs when d = 2 and |.| is 
the Euclidean norm) is NP-hard (Clark et al [3], Graf et al [5]), their clique number may 
be found in polynomial time [3], unlike finding the clique number in general graphs. In 
fact the clique number of a unit disk graph may be found in polynomial time even if 
an embedding (that is, an explicit representation with points on the plane) is not given 
(Raghavan and Spinrad [17]). Thus, the results given here suggest that even though finding 
the chromatic number of a unit disk graph G is NP-hard, for graphs that are not very sparse 
the polynomial approximation of finding the clique number and multiplying this by | might 

work quite well in practice. Note that for the Euclidean norm in the plane 4 = ~ 1.103: 
also in this case always x(G)/oo(G) < 3 (Peeters [T5]). 

It is instructive to consider for comparison the ratio of chromatic number to clique 
number in the Erdos-Renyi model, with expected degree similar to the values we have 
been investigating. Let us consider p = p(n) such that np — > oo with np = o(ns) as 
n — > oo. Then u(G(n,p)) = 3 whp (see for example Bollobas [?] Theorem 4.13), and 
x(G(n,p)) ~ w hp (Luczak [9], or see [?] Theorem 11.29); and thus whp 

x(G(n,p)) np 

~77y v\ ~ «1 y 00 wh P • 

uj(G(n,p)) bmnp 

Also, when does x{G{n,p)) = u(G(n,p)) whp? This property holds if np — > as n — > oo, 
since G(n,p) is then a forest whp; and conversely, if the property holds (and p is bounded 
below one) then np — > 0. Thus, the results on the chromatic number given here, and in 
the earlier work of the first author [12] and Penrose [TB], highlight a dramatic difference 
between the Erdos-Renyi model on the one hand and the random geometric model on the 
other hand. 
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Although we have presented substantial progress on the current state of knowledge on 
the chromatic number of random geometric graphs in this paper, several questions remain. 
Our proofs for instance do not yield an explicit expression for to (when 5 < 1) and it would 
certainly be of interest to find such an expression or to give some (numerical) procedure 
to determine it, in particular for the euclidean norm in M?. More generally, it is far from 
trivial to extract information from the expressions for f x {t) and f x / w {t). In particular we 
would be interested to know whether f x is differentiable. In particular: is f x differentiable 
at t ? Also Lemma IBTTl suggests the question: Is f x (t) > ^'if for all < t < oo? 

Another natural question concerns the extent to which the chromatic number is 'local' 
or 'global'. Define the random variable R n to be the infimum of the values R > such 
that G n contains a subgraph H n which is induced by the points in some ball of radius R 
and which satisfies x(H n ) = x(G n ). In the very sparse case when nr d < n~ a for some 
fixed a > 0, the proof of Proposition 1.6 shows that R n < ST cL.cL.cL.S. for any fixed e > 0, 
so the behaviour is 'very local'. Now suppose that anr d ~ thxn for some t > 0. Then by 
Lemmas 16.11 and I6.2[ for t sufficiently small we have R < 2r cL.cL.cL.S. SO still the behaviour 
is local. But what happens for large tl 

A question that has not been addressed in this paper (except in the "very sparse" case 
when nr d is bounded by a negative power of n) is the probability distribution of x{G n )- 
In a recent paper by the second author [13] it was shown that whenever nr d <C Inn then 
x(Gn) is two-point concentrated, in the sense that 

¥( X (G n )e{k(n),k(n) + l})^l, 

as n — > oo for some sequence k(n). Analogous results were also shown to hold for the 
clique number, the maximum degree and the degeneracy of G n . For other choices of r the 
distributions of these random variables are not known. However, it is possible to extend 
an argument in Penrose [16] to show that if v is the uniform distribution on the hypercube 
and Inn <C nr 2 <C (lnn) d then there are (a n ) n and {b n ) n such that (A(G n ) — a n )/b n tends 
to a Gumbel distribution, and if nr d = 0(lnn) then A(G n ) is not finitely concentrated, 
but the distribution does not look like any of the standard probability distributions. 
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A Proof of Lemma 16.41 of Boroczky 



Lemma [6.41 above is due to K. Boroczky Jr. and with his kind permission we give a proof 
here, as it is not readily available from other sources. 

Proof of Lemma 16. 41 Let / := fl a6y !i(a + C). Then / is compact and convex. We may 
suppose wlog that vol(J) > (as otherwise there is nothing to prove). Let us remark that 

/ + cl(-A) C C. 

This is because for any x G / and a G A there exists a c G C such that x = c + a, by 
definition of /. Thus, for any x G /, a G A we have x — a G C: in other words I+(—A) C C . 
This also gives that / + cl(— A) = cl(J + (—A)) C C as C is closed. 

We now use the Brunn- Minkowski inequality (see chapter 12 of Matousek [11] for a 
very readable proof). This states that, if A, B C M. d are nonempty and compact, then 

vo\(A + B) > \vo\{A)^ + vol(-B)5j . By this inequality 

vol(/)3 + vol(cl(-A))^ < vol(J + cl(-A))3 < vol(C)i 

Thus 

vol(J) < (vol(C)^ - vo\{A)*\ . (82) 

The lemma will now follow if we show that equality holds in ( |82l) when A is of the form 
A = X(—C) for some A > 0. Let us then suppose that A = X(—C) for some < A < 1 
(note that A > 1 would contradict vol(J) > 0). 

We claim that (1 — X)C C I. Let x G (1 — X)C, and let a G A = A(— C) be arbitrary. 
We can write x — (1 — A)ci and a = — Ac2 for some c\,C2 G C. Because C is convex, 
C3 := (1 — A)ci + AC2 G C and thus x = a + C3 G (a + C). As a G A was arbitrary this 
gives x G I. Thus indeed (1 — X)C C J, as claimed. But now vol(J) > (1 — A) d vol(C) and 
vol(A) = A d vol(C)); and so equality holds in (1H2"|) . and we are done. ■ 
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